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Universities challenged 


The accelerating pace of change in today’s world means that universities must modify how they 
fulfil their function of seeking and sharing knowledge. 


hat is a university? To Shelby Foote, the US novelist and Civil 
We historian, it was merely “a group of buildings gathered 

around a library”. To generations of students, it provided the 
best times of their lives. To many Nature readers, it is an employer. For 
Nature itself, many are customers. A university, to linguists, is a deriva- 
tion ofa Latin description of a community of teachers and scholars. 

For more than 1,000 years, that crucial sense of community has 
endured. There is natural synergy between education — the transfer 
of knowledge — and research — the creation of knowledge. It makes 
sense for teachers and scholars to sit next to each other; better still, for 
them to be the same person. 

Universities have always changed with the times. But there is a 
growing sense that the pace of that change is accelerating. More funda- 
mentally, universities are losing control of the process. Change is being 
forced on them. The community of teachers and scholars will surely 
endure; it is too powerful to ignore. But the form that that community 
takes could change profoundly. Whatever a university looks like today, 
it seems certain that the universities of 2030 will look very different. 

This special issue of Nature tackles the matter head-on. And it does 
so by reporting on several experiments taking place at universities 
across the world, which are examining new approaches to both teach- 
ing and scholarship. This is an international concern, and the model, 
funding and operation of universities vary considerably from country 
to country. That makes it difficult to generalize about possible solu- 
tions. But, to a greater or lesser extent, all universities are buffeted by 
external forces from the same three directions. Two of these have been 
gathering strength for decades, and the other is just getting under way. 

First, universities are educating more people. Their original 
expansion just over a century ago saw them broaden from training the 
aristocracy in the classics to offering a range of professional schools for 
law, medicine and science. In recent decades, the expansion has been in 
the number and type of student. A series of welcome social changes has 
brought access to a university education to a larger and more diverse 
spectrum of people. This must all be paid for somehow. 

The second shift is out of the ivory tower. Universities are no longer 
viewed predominantly as places driven by curiosity and a thirst 
for knowledge. Instead, they are drivers of economic development. 
Success is defined by graduate employment. Research is problem- 
oriented. Scientists are budding entrepreneurs. Knowledge is included 
on the balance sheet, and policies are being introduced to produce the 
greatest return on investment. 

The newest challenge is a reassessment of how learning and innova- 
tion happen, and how they should be delivered. From the rise of online 
tuition, such as massive open online courses (MOOCs), to shifting 
attitudes to the value of traditional one-to-many lectures, to the erosion 
of the classic lone-genius model of research, the very foundations of 
the centuries-old university concept are under attack as never before. 

As discussed in this week’s collection of News Features and 


Comment articles (see page 287 and nature.com/universities), 
universities are finding their own ways to respond to these pressures. 
Appropriately, given the subject matter, there has been much academic 
thought and discussion about the direction that universities should 
take, and the possible pitfalls that lie along the way. 

From publishing and sport to finance and retail, commentators are 
queuing up to find parallels between higher education and other sectors 
that have undergone rapid change driven by the rise of technology or 
by restless political and social circumstance. Consultants warn that 
spiralling costs will see many universities go bankrupt, and web zealots 

claim that there is no need for future physical 


“The very meetings between teacher and pupil. 
foundations of Ultimately, there are two possible univer- 
the centuries- sities of the future. There is the theoretical 
old university one: the institution devised in the abstract 
concept are by selecting the most-innovative technolo- 
under attack as gies and most-appealing ideas and packaging 


them together. This is the future of flying cars 
and Mars colonies. It may come to pass, but 
it is hard to see how. Then there is the university of the future that 
remains firmly rooted in the university of the present and the past; a 
place where students, teachers and scholars gather to share and seek 
information, and where both the information and the process that 
uncovers it have value. 

Not all existing universities may have such a future. The present is 
moving too fast for a definitive idea about the practices and institutional 
structures they will need to survive. And so the only sensible strategy is 
for them to do what science has always done: experiment and see what 
works. As we show in these pages, that process is already under way. = 


never before.” 


Dust to dust 


What lessons can be learned from the 
presentation of the gravitational-waves story? 


ore than six months after the initial announcement that 
M scientists had found evidence of gravitational waves — ech- 
oes of the Big Bang itself — the claim is hanging by a thread. 
Subsequent analysis showed that much of the signal could have been 
contaminated by galactic dust. The predictions of Nobel prizes for the 
team have faded. The champagne has gone flat. 
Extraordinary claims, as the saying almost goes, demand more scru- 
tiny than usual to make sure they stand up. That is how science works. 
Claim and counter-claim: intellectual thrust and experimental parry. 
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The tale of the gravitational waves has some way to rumble on yet. 
Next week, a meeting in Columbus, Ohio, organized by the Council for 
the Advancement of Science Writing, a panel of scientists and journal- 
ists, will search for “lessons learned by scientists and science writers 
involved with the BICEP2” story. What will these be? 

The first thing to highlight is that such a thing as the Council for 
the Advancement of Science Writing even exists. Too many scientists 
dismiss the media and journalists as sloppy and unwilling to engage 
in both detail and ambiguity. In fact, there can be no branch of jour- 
nalism as self-scrutinizing and anxious about its performance as that 
which covers science. It is hard to imagine political and sports report- 
ers taking the time to discuss so thoroughly what (if anything) they 
did wrong after one of their stories went belly-up. 

The (welcome) rise of the science blogger has fuelled this navel- 
gazing. Some bloggers seem to spend most of their time criticizing 
other science writers, or at least debunking examples of what they 
regard as inferior science writing. But they do lots of good stuff too. 
Although traditionalists lament the decline of science coverage in the 
mainstream press, a terrific amount of analysis and comment, much 
of it very technical, is happening online under their noses. 

Nature has a stake in discussions of the gravitational-waves story. 
Our news team was among those tipped off about the claim in 
advance. We were proud of our (extensive) coverage, both in print 
and online, at the time. We remain so now. Like most other news 
organizations, we reported the claims from the provisional paper 
accurately and, like almost all the coverage, were sure to include the 
caveat that the findings would need to be confirmed. That is not to 
claim that the press can be given a free pass on this. Its job is to ask 
questions after all. But it is not always possible for journalists — even 
the best science writers — to provide the answers. 


What about the promised lessons for scientists? As we have pointed 
out before, researchers must not be afraid to be wrong. With hindsight 
they may feel they rushed to publish their claim too quickly, but pro- 
fessional science is a competitive and fast-moving field. The academic 
paper was cautious and the team’s reaction to subsequent criticism 
seems constructive. Some may question the timing of the announce- 
ment, made when the paper was released on the Internet, not accepted 
or published by a journal, but at least the evi- 


“As BICEP2 dence was there to examine. If the scientists 
clearly and the media both largely acted properly, 
demonstrates , then what should be discussed at next week’s 
most science meeting? It could do worse than start by 
is aworkin screening the celebratory online video pro- 
progress.” duced by California's Stanford University and 


released to accompany the announcement. 
Scientists and journalists can include as many academic caveats as they 
like, but the sounds and images of champagne corks popping tend to 
render such statements of caution just that — academic. 

There is a deeper issue here: science not by press conference but 
presented as an event. What in reality is a long, messy and convoluted 
process of three steps forward and two steps back is too easily presented 
as giant leaps between states of confusion and blinding revelation. At the 
heart of this theatre is the artificial landmark of a peer-reviewed paper. 
Fixed print schedules and releases to journalists under embargo (with or 
without champagne videos) help to lend the impression that the publica- 
tion ofa paper is the final word on a question — the end-of-term report 
ona scientific project that details all that was achieved. 

As BICEP2 clearly demonstrates, most science is a work in progress. 
Which is surely good news for scientists, who remain useful, and for 
science writers, who will always have something to cover. m 


Review rewards 


Welcome efforts are being made to recognize 
academics who give up their time to peer review. 


review in a year? Many researchers would estimate two or 
three dozen; Malcolm Jobling, a fish biologist at the University 
of Tromsg in Norway, says that he has racked up more than 125 already 
this year. How do we know? A welcome movement is under way to 
publicly register and recognize the hitherto invisible efforts of referees. 

Jobling’s staggering total is revealed at Publons, a New Zealand- 
based start-up firm that encourages researchers to post their peer- 
review histories online (for an interview, see Nature http://doi.org/ 
wbp; 2014). Publons is not the only attempt to recognize and reward 
academics for their refereeing activity. As Nature noted last year (see 
Nature 493, 5; 2013), publishers are increasing their efforts to reward 
assiduous reviewers. The Nature journals give a free subscription to 
anyone who has refereed three or more papers in a year for them, 
and allow peer reviewers to download a statement of work. Similarly, 
science publisher Elsevier this year launched a system to formally 
recognize its peer reviewers, and to give rewards to ‘outstanding 
reviewers — those who have reviewed the most papers. 

Unlike Publons, which hopes to establish a cross-publisher profile, the 
activities of individual publishers are restricted to their own platforms. 
But publishers are taking part in broader talks to establish standards 
to publicly record peer-review service in a researcher's ORCID (Open 
Researcher and Contributor ID) profile. Those discussions, under the 
auspices of the Consortia Advancing Standards in Research Administra- 
tion (CASRAJ), an international non-profit group, are also looking at 
ways to record other types of peer review — including reviews of grant 


| ] ow many manuscripts is it reasonable for a scientist to peer 
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applications, conference abstracts, service as a journal editor and insti- 
tutional benchmarking (for example, being on the panel of a national 
research audit such as the UK Research Excellence Framework). 

Researchers could use their reviewer records to highlight their 
expertise for employers and government agencies. If enough informa- 
tion can be publicly revealed, it could shed more light on the average 
number and type of review undertaken by scientists, who increasingly 
complain that they are overwhelmed with peer-review requests. 

The final direction of the drive to publicly record and reward peer 
review is far from clear. Publons — among others — hopes that there 
will be more cases of open, signed reviews (which will make it easy to 
recognize a referee's contribution). Yet the majority of pre-publication 
reviews remain private: many researchers are uncomfortable about 
being publicly revealed as the author ofa critical review because of the 
fear of subtle reprisals in other areas of their career. Unless this culture 
shifts, efforts will stay focused on allotting credit for reviews whose 
text and author remain secret. 

Recording the number of reviews is only the start. A well-considered 
review that substantially improves a paper can take days — whereas a 
sloppy reviewer could dash off assessments of many papers in a few 
hours. So the next challenge in publicly recognizing peer review will 
be to find a way to assess quality. Many journal editors already have an 
informal idea of their ‘good’ and ‘bad’ reviewers, which in some cases 
can be quantified by response time. But these judgements are not usually 
shared with colleagues, and may differ from one editor to another. Lutz 
Prechelt, an informatics researcher at the Free University of Berlin who 
is advising Elsevier on its programme, has suggested that both authors 
and editors could be asked to mark the helpfulness and timeliness of a 
review. But it will be important to ensure that the benefits of this system 
are not drowned by the bureaucracy involved. 

Efforts to publicly recognize peer review are 
still in their infancy. But as attempts to acknowl- 
edge and reward a crucial role, they should be 
applauded. = 
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D V E W A personal take on events 


bomb parts of the self-proclaimed Islamic State (commonly 

nown as ISIS), the long-term effects of the offensive remain 

unclear. What is extremely worrying, however, is that ISIS has more 

financial power — thanks to seized oil production and black mar- 

kets — than al-Qaeda had even when it perpetrated the 11 September 
2001 attacks against the United States. 

Back then, private insurers paid out US$44 billion in claims, allowing 
most of the affected businesses to bounce back and to protect jobs. But 
after the shock of 9/11 — then the most costly disaster in the history of 
insurance — insurers started to exclude terrorism from their policies, 
so governments were forced to take on some of the risk. 

Thirteen years later, terrorism risk insurance remains a peculiar, 
yet crucial, business. Governments want their 
citizens and corporations to have financial protec- 
tion. But, for security reasons, they are reluctant 
to share information about a terror threat that is 
dynamic in nature: terrorist organizations morph 
and adapt to governments’ foreign policies and 
countermeasures. Yet to decide on a cost-sharing 
agreement, and to make sure that compensation 
will be in place, all involved in the negotiations 
must have some idea of the probable impact of a 
possible terrorist act. 

Most national insurance schemes have a thresh- 
old at which the government intervenes. The 
United States is currently debating this as part of 
discussions on the renewal of its post-9/11 agree- 
ment, the Terrorism Risk Insurance Act (TRIA), which is scheduled to 
expire in December. Congress and the president should renew it. 

The United States might be the most high-profile target for terrorists, 
but the threat is international. Still, only 10 of the 34 countries in the 
Organisation for Economic Co-operation and Development (OECD) 
have established national public-private terrorism risk-sharing pro- 
grammes that are transparent and legally binding. This is short-sighted: 
ex ante risk-sharing can considerably reduce the market and political 
over-reaction after a large attack. 

On 10 September, the day that President Barack Obama announced 
the United States’ offensive against ISIS, I chaired an OECD conference 
in Washington DC that gathered together the heads of these national 
programmes, insurance executives and scientists from around the world 
to discuss the status of these programmes and ways to improve them. 

A key issue that emerged is the importance of determining the prob- 
able maximum financial impact ofa terrorist attack, and who should 
foot the bill. Private insurers can cover losses of 


A s aircraft from the United States and other nations continue to 


a few billions of dollars without assistance. Butif NATURE.COM 
losses are likely to reach $50 billion or $100 bil- _ Discuss this article 
lion, then they want government guarantees that __ online at: 


their liability will be capped at an acceptable level. —_go.nature.com/2imsgw 
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How terror-proof is 
your economy? 


Scientists can help to develop a financial safety net by providing transparent 
market data and loss-impact analysis, says Erwann Michel-Kerjan. 


This is where scientists have a role. Through practical experience and 
theoretical models, researchers — from weapons specialists, chemists 
and engineers, to physicians, economists and psychologists — hold 
knowledge that can help each country with that analysis. 

By combining data such as infrastructure systems, working patterns, 
evacuation plans, health impacts, business interruption and recovery 
time with a range of plausible attack scenarios, one can predict eco- 
nomic loss. And by combining that with information on market condi- 
tions, it is possible to quantify how terrorism losses will be spread across 
different stakeholders under alternative national risk-sharing designs. 

The Wharton Risk Management Center's “TRIA after 2014’ study, 
which was undertaken with Risk Management Solutions and which I 
co-led, does that for the four largest cities in the United States. It found, 
for example, that the loss caused by a 1-tonne 
sarin chemical-agent attack would range from 
$9 billion in Houston, Texas, to $25 billion in 
New York. By comparison, a 10-tonne truck- 
bomb attack would cost $28 billion and $32 bil- 
lion, respectively. Under the current TRIA, the 
US federal government would not pay any of 
those damages — insurers and businesses would. 

Costs would be much higher for a 1-kilotonne 
nuclear-weapon attack: around $170 billion in 
Houston, $230 billion in Los Angeles, $340 
billion in Chicago and $550 billion in New York 
City. Losses of property and because of busi- 
ness interruption account for the majority of the 
cost, but workers’ compensation loss is large, too 
(because of the combination of blast effects, and thermal and nuclear 
radiation). 

Although there is a consensus that such a nuclear attack would be 
much harder for terrorists to perpetrate without being spotted by 
intelligence services, ISIS’s financial capability should not be under- 
estimated; nor should the possibility that it, or other organizations, 
will decide to adopt other modes of attacks, such as using cyberspace. 

With proper access to data on exposure and insurance markets, 
our methods could be applied to help any country to make informed 
decisions, whether or not it has a terrorism insurance programme. 
The OECD could provide a neutral platform for these discussions. 

This is a new global context. Of course, proper insurance coverage 
will not prevent the next large-scale terrorist attack. But it will help 
economies to become more terror-proof, bringing an ounce of stability 
into our turbulent world. m 


Erwann Michel-Kerjan is executive director of the Wharton Risk 
Management Center at the University of Pennsylvania in Philadelphia, 
and chairman of the OECD Board on Financial Management of 
Catastrophes, headquartered in France. 

e-mail: erwannmk@wharton.upenn.edu 
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RESEARCH HIGHLIGHTS 


Benefits outweigh 
clean-energy costs 


Large-scale investments in 
wind, solar and hydropower 
could double the electricity 
generated globally from these 
sources by 2050 — with only 
modest environmental costs. 
Thomas Gibon of the 
Norwegian University of 
Science and Technology in 
Trondheim and his colleagues 
compared the environmental 
impacts of low-carbon and 
fossil-fuel-based power 
generation over the entire life 
cycle of these installations. 
They found pollution from 
the construction of renewable- 
energy infrastructure is 
ultimately small compared with 
direct emissions from gas- and 
coal-fired power plants, even if 
a large amount of carbon from 
these plants is later captured 
and stored. 
Proc. Natl Acad. Sci. USA 
http://doi.org/v8d (2014) 


Birds colour-match 
their nests 


Zebra finches seem to actively 
camouflage their nests when 
building them. 

Many birds’ nests appear 
camouflaged, but this could be 
a serendipitous result of their 
use of local materials. Ida Bailey 
at the University of St Andrews, 
UK, and her team let 20 male 
zebra finches (Taeniopygia 
guttata; pictured) choose 
between two types of paper 


Selections from the 
scientific literature 


METEOROLOGY 


Weather explains Asian glacier survival 


Some glaciers in central Asia could be 

weathering climate change better than those 

in neighbouring mountain ranges because of 

different seasonal weather patterns. 
Geoscientists have puzzled over why the 

glaciers of the Karakoram region (pictured) 

have not receded as much as others nearby. 

A team led by Sarah Kapnick of Princeton 

University, New Jersey, compared about 30 years 

of temperature and precipitation data up to 2007 


with climate simulations covering 1861 to 2100. 
The team found that Karakoram gets most of its 
precipitation during winter. By contrast, nearby 
ranges such as the central Himalayas experience 
mainly summertime rains driven by monsoons. 

This seasonal difference could be preventing 
the Karakoram glaciers from shrinking, and 
could even be causing some of the glacier 
expansion seen there in the past several years. 
Nature Geosci. http://doi.org/v9g (2014) 


strip when building their nests: 
one matching the cage colour 
and the other contrasting. Of 
the birds, 14 predominantly 
chose the colour that matched 
the cage decor. 

This is the first experimental 
evidence that birds choose to 
camouflage their nests, say the 
authors. 

The Auk 132, 11-15 (2015) 


Plants inspire 
medical coating 


A coating for medical implants 
such as artificial heart valves 
could prevent blood-clot 
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formation — acommon 
problem in which blood 
cells and proteins stick to the 
surfaces of such devices. 

To make the surfaces less 
sticky, Donald Ingber of 
Harvard University in Boston, 
Massachusetts, and his team 
adapted technology inspired by 
the carnivorous pitcher plant, 
which has a slick layer of water 
that causes insects to slide into 
the plant’s ‘mouth. 

The authors designed a 
two-layer coating: the first 
layer uses a perfluorocarbon 
to bind to smooth surfaces, 
and the second is a slippery 
film of medical-grade liquid 
perfluorocarbon. Tubing 


coated with this material had 

a lower build-up of clots and 
microorganisms than uncoated 
tubing when implanted in pigs. 
The material could reduce the 
need for anti-clotting drugs, 
which can cause bleeding. 
Nature Biotechnol. http://doi. 
org/v9j (2014) 


A monkey model 
of Alzheimer’s 


The molecule that has been 
implicated in Alzheimer’s 
disease causes many hallmarks 
of the disorder in monkey 
brains, suggesting the 
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M. ORTIZ CATALAN ETAL., SCI. TRANSL MED. 6, 257RE6 (2014) 


potential for a primate model 
of the disease. 

Amyloid-f forms plaques 
in the brains of people with 
Alzheimer’s. Fernanda De 
Felice at the Federal University 
of Rio de Janeiro, Douglas 
Munoz at Queen’s University 
in Kingston, Canada, and 
their colleagues injected small 
aggregates of amyloid-f into 
the brains of macaques. They 
found that the molecule ended 
up in key cognitive centres, 
where they noticed many 
of the same changes seen in 
diseased brains, such as the 
loss of neuronal connections. 

Alzheimer’s research relies 
heavily on rodent models, 
and these findings could lead 
to the development of better 
animal models of the disease, 
the authors say. 

J. Neurosci. 34, 13629-13643 
(2014) 


BIOTECHNOLOGY 


Another try at gene 
therapy for SCID 


Gene therapy has cured 
children who have severe 
combined immunodeficiency 
(SCID), without so far causing 
cancer as previous treatment 
forms did. 

David Williams at Boston 
Children’s Hospital in 
Massachusetts, Alain Fischer 
of the Necker Hospital for 
Sick Children in Paris and 
their co-workers made a viral 
vector containing a corrected 
version of the mutated gene 
that otherwise hobbles the 
immune systems of children 
with SCID. Nine boys were 
treated; eight survived during 
the 1-3-year follow-up period, 
while one died of an infection 
that predated the treatment. 

The researchers deleted 
certain key sections of the viral 
vectors DNA and found that 
the virus did not insert itself as 
often into cancer genes in the 
patients’ genomes as earlier 
versions of the virus did. None 
of the boys has yet developed 
cancer, but the researchers note 
that only long-term monitoring 
will rule out that possibility. 

N. Engl. J. Med. 371, 1407-1417 
(2014) 


Cities will grow 
thirsty 


The number of large cities 
prone to insufficient water 
supplies could increase over the 
next 25 years — even without 
accounting for climate change. 

Julie Padowski and Steven 
Gorelick at Stanford University 
in California used projected 
urban population growth 
and increasing agricultural 
demands to assess changes 
in water needs. They focused 
on 71 cities around the world 
that depend on water from 
surface rivers or reservoirs, 
and estimate a 28% increase 
in the number of cities that 
will suffer supply vulnerability 
in 2040 compared with 2010. 
Among the most vulnerable 
are Ouagadougou, Burkina 
Faso; Guangzhou, China; and 
Dublin, Ireland. 

Redistributing water from 
agriculture and from other 
non-urban areas could mitigate 
water shortages, the duo says. 
Environ. Res. Lett. 9,104004 
(2014) 


Marine slime 
ferries parasite 


Sticky molecules found in 
aquatic ecosystems could 
help to transmit land-based 
pathogens to marine animals. 
Karen Shapiro at the 
University of California, Davis, 
and her colleagues added. 
varying levels ofa gelatinous 
compound, alginic acid, to 
seawater samples containing 
the parasite Toxoplasma gondii, 
which is carried by cats. They 
found that it increased the 
number of parasites stuck to 
marine aggregates, and that 
similarly sticky molecules also 
allow the parasite to adhere 
to kelp surfaces. Snails, which 
graze on kelp, ingested and 
accumulated the pathogen. 
Sea otters are known to eat 
snails, and this finding could 
explain why the mammals have 
been infected with T. gondii. 
Proc. R. Soc. B 281, 20141287 
(2014) 


RESEARCH HIGHLIGHTS BiiSaiaa¢ 


SOCIAL SELECTIO 


Popular articles 
on social media 


Online fun with Nobel forecasts 


As this year’s Nobel laureates were inundated with 
congratulations online, the few researchers who correctly 
guessed the winners also earned themselves a little kudos. For 
example, Kate Jeffery, a neuroscientist at University College 
London, correctly foretold on Twitter that her colleague John 
O’Keefe would win the Nobel Prize in Physiology or Medicine 
for work on the brain’ positional system. 
In an interview, Jeffery said that she wasnt just making 
a casual prediction, but was actively rooting for her former 
postdoc adviser. She also had reason to celebrate the other 
two winners, May-Britt Moser and Edvard Moser of the Kavli 
Institute for Systems Neuroscience in Norway. As a PhD 
student, Jeffery worked in the same lab 


2D NATURE.COM 
For more on 

popular papers: 
go.nature.com/yhvxxd 


NEUROTECHNOLOGY 


Better control 
over bionics 


Two groups have developed 
technologies for artificial 
arms that give people finer 
control over the limb than over 
conventional prostheses. 
Daniel Tan at the Louis 
Stokes Veterans Affairs 
Medical Center in Cleveland, 
Ohio, and his colleagues 
implanted electrodes in the 
arm muscles of two people, 
who each had a prosthetic arm 
and hand. Pressure sensors 
in the bionic fingers together 
with the embedded electrodes, 
which sent complex electrical 
patterns to residual nerves in 
the arm, enabled the subjects 
to sense different types of 
touch — such as tapping and 
constant pressure — without 


as the Mosers when they were postdocs. 
“Tt really has been a delight to see a 
Nobel-prizewinning discovery unfold 
from start to finish,” she said. 


feeling the tingling caused by 
previous devices. This allowed 
them to handle delicate objects 
such as cherries. 

Ina separate study, Max 
Ortiz-Catalan at Chalmers 
University of Technology in 
Gothenburg, Sweden, and 
his co-workers attached an 
artificial arm (pictured) toa 
man’s humerus bone, using the 
implant to direct electrodes 
to specific arm muscles. 

The electrodes detected the 
man’s intended movements 
better than conventional 
skin sensors, allowing for 
more-precise control of the 
prosthesis. 

Sci. Transl. Med. 6,257ra138; 
257re6 (2014) 
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SEVEN DAYS sescnsi 


POLICY 


Economic contagion 
The World Bank reported on 
7 October that the regional 
economic impact of the Ebola 
virus across West Africa 
could reach US$32.6 billion 
by the end of 2015, if the 
outbreak is not quickly 
contained in Guinea, Liberia 
and Sierra Leone—the three 
most affected countries. 

The projections expand 

on estimates published last 
month (see go.nature.com/ 
dbznpb), and account for the 
probable spread of the disease 
and economic consequences 
from the core nations to 
neighbouring countries with 
much larger economies. 


Environment report 


Canada must step up long- 
range planning on several 
crucial environmental fronts, 
according to a 7 October 
government report. Among 
other areas, environment 
commissioner Julie Gelfand 
highlighted shipping areas in 
the Canadian Arctic that are 
poorly charted, increasing the 
risk of accidents as marine 
traffic rises. The government 
also needs to outline more 
clearly how it plans to reduce 
Canada’s greenhouse-gas 
emissions, said the report, 
and how it will monitor the 
environmental impact of 
developing oil sands in Alberta. 


Tar sands squeak by 


Ina proposal released on 

7 October, the European 
Commission backed down 
from plans to label fuels 
derived from tar sands as 
more polluting than other 
fuels. Member states have 

yet to approve the move — 
although in 2012, they rejected 
a proposal to restrict fuel from 
tar sands that would have 
reclassified its environmental 
impact. The states were 
concerned that the change 


Wreck yields fresh booty 


Researchers revisiting an ancient shipwreck off the Greek 
island of Antikythera have rescued further treasures from the 
massive ship's remains, which are scattered over a much larger 
area than previously thought. Divers from the Woods Hole 
Oceanographic Institution in Massachusetts and the Ephorate 
of Underwater Antiquities in Athens discovered the artefacts 
during an excavation season that finished on 7 October. The 
2,000-year-old site is best known for yielding an intricate 
navigation contraption, the Antikythera Mechanism. Among 
the latest finds are an ornate bed leg, an intact jug and a giant 
bronze spear (pictured) thought to have been part ofa statue. 
See go.nature.com/odmwtp for more. 


would rile Canada, which is 
has extensive tar sands and 
holds the world’s second largest 
oil reserves. See go.nature. 
com/yakmur for more. 


Fossil-fuel retreat 


The University of Glasgow has 
committed to withdrawing 

its investments from the 
fossil-fuel industry, becoming 
the first UK university to do 
so. The move comes after 
nearly a year of campaigning 
by the Glasgow University 
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Climate Action Society, a 
student advocacy group. 

In an announcement on 

8 October, the university said 
that it would reallocate around 
£18 million (US$29 million) of 
investments over 10 years. 


Stifling science 
Media policies at Canada’s 
government science agencies 
fail to support open and 
timely communication 
between researchers and 

the public, says a report 


released on 8 October by 
Evidence for Democracy, an 
advocacy group in Ottawa. 
Of 16 departments studied, 
15 require approval for media 
interviews with scientists. 
Canada’s health agencies 
scored lowest for protecting 
scientists’ speech against 
political interference, allowing 
only approved spokespeople 
to talk to the media, and 
requiring all interviews to 

be monitored by a media- 
relations professional. 


Climate threat 

The US defence department 
has called climate change an 
immediate risk to national 
security, with the potential 

to exacerbate threats such 

as infectious disease and 
terrorism. In a report issued 
on 13 October, defence 
officials said that climbing 
global temperatures, rising sea 
levels and extreme weather 
events would worsen food and 
water shortages, pandemics 
and political instability around 
the world. The department 
will continue to integrate 
climate-change risks into its 
operations, including reviews 
of strategic locations for 
weapons stores and crucial 
supplies. 


pS FUNDING 
Funds for big data 


The US National Institutes 
of Health announced on 

9 October US$32 million in 
grants to help to make large, 
complex biomedical data sets 
more accessible and more 
informative. This fiscal year’s 
awards, part of the Big Data 
to Knowledge initiative, are 
an initial investment that the 
agency expects to build up to 
a total of nearly $656 million 
over the next six years. The 
current funding will establish 
12 centres of excellence for 
big-data computing, and will 
support the development of 


BRETT SEYMOUR/RETURN TO ANTIKYTHERA 
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SOURCE: IEA 


new data-mining approaches, 
software and data-science 
training programmes. 


Telescope tensions 


Protestors showed up in 

force at a 7 October ground- 
breaking ceremony for the 
Thirty Meter Telescope on 
Mauna Kea, Hawaii. Many 
Native Hawaiians are opposed 
to the construction because 
they consider Mauna Kea 

to bea sacred mountain. 
Protestors blocked the road 

to the summit as scientists 
and officials made their way 
to a ceremony that included a 
Native Hawaiian blessing. The 
incident highlights ongoing 
tensions between astronomers 
and members of the local 
community who have long 
resented the large number 

of telescopes already on the 
mountain. 


} RESEARCH 
Sea tragedy 


A Taiwanese research vessel 
capsized on 10 October, 
killing two scientists. Ocean 
Researcher V sank in the 
Taiwan Strait (pictured), 
one day into a voyage to 
study air pollutants. Forty- 
three scientists and crew 
members were rescued, but 
Hsu Shih-chieh, a researcher 
at Academia Sinica in Taipei, 
and Lin Yi-chun, an engineer 


TREND WATCH 


Untapped natural gas and 
renewable energy sources in 
sub-Saharan Africa should be 
exploited to boost prosperity and 
electricity access, the Paris-based 
International Energy Agency 
noted in a 13 October report. 

The agency expects that by 2040, 
950 million people will gain 
electricity in the underserved 
region. But more than half a 
billion people will still be in the 
dark, mainly in rural areas, where 
small-scale ‘micro-grids’ will be 
needed (see Nature 507, 154-156; 
2014). 


widespread use of fracking in 
the area, leading the authors 
to attribute the hotspot to 
leaks from more established 
methods of fossil-fuel 
extraction and processing in 
New Mexico. 


Record high sea-ice 


The sea ice surrounding 


at the Taiwan Ocean Research Antarctica hit a maximum this 
Institute in Kaohsiung City, year of 20.11 million square 
died. An investigation into kilometres, the US National 
the cause of the disaster is Snow and Ice Data Center in 
under way, and Taiwan's Boulder, Colorado, reported 
science ministry is considering on 7 October. This is a record 
whether to replace the high since satellite records 
1.46-billion-Taiwan-dollar began in 1979. Scientists are 
(US$48-million) vessel. See unsure why the Antarctic 
go.nature.com/qwjamr for sea ice has been growing so 
more. rapidly of late, setting new 
records in both 2012 and 2013. 
Methane hotspot 
A 6,500-square-kilometre Space-weather hub 
spot in the southwestern The United Kingdom 
United States is spewing the opened its first space- 
country’s largest concentration weather forecasting centre on 
of methane emissions. 8 October. Based at the Met 
Located near the intersection Office in Exeter, the centre 
of Arizona, Colorado, will predict how radiation, 
New Mexico and Utah, the energetic particles and 
methane hotspot released fluctuating magnetic fields 
about 0.59 million tonnes of ejected from the Sun are likely 
the potent greenhouse gas to affect technology on Earth, 
each year between 2003 and such as communications 
2009, according to an analysis satellites and power grids. 
of satellite data published on The centre will operate 
9 October that tripled previous around the clock, providing 
ground-based estimates public forecasts and early 
(E. A. Kort et al. Geophys. warning of threats to crucial 
Res, Lett. http://doi.org/v9f; infrastructure. See go.nature. 
2014). The data pre-date the com/yuiplt for more. 


RISE OF RENEWABLES IN AFRICA 


Renewables could make up 41% of sub-Saharan Africa's electricity 
generation in 2040, says the International Energy Agency. 


2012 total generation: 
440 terawatt hours (TWh) 


56% 9% 22% 


1% 
Other 
renewables 


9% 3% 
2040 total generation: 1,540 TWh 
27% 4% 26% 2% Th 
Coal Oil Hydro Wind Solar 


25% 3% 3% 3% 
Gas Nuclear Bioenergy Geo- 
thermal 


SEVEN DAYS | THIS WEEK | 


19 OCTOBER 

Comet Siding Spring 
sweeps within 

140,000 kilometres of 
Mars, giving researchers 
a rare chance to make 
up-close observations 
using Mars orbiters. 


Galileo mishap 

A frozen fuel line caused the 
botched launch that placed two 
satellites for Europe’ Galileo 
global navigation system into a 
misshapen orbit on 22 August, 
according to an announcement 
on 8 October. An independent 
panel appointed by launch 
operator Arianespace, the 
European Space Agency (ESA) 
and the European Commission 
to investigate the failure 

found that the hydrazine fuel 
line for the rocket’s upper 

stage had been improperly 
clamped together with a cold 
helium line. The error caused 
the hydrazine line to freeze 

and prevented it from firing 
correctly. ESA is investigating 
whether the satellites can still 
be made useful for navigation 
(see Nature http://doi.org/v9w; 
2014). 


ey REGELE | 
PhD at stake 


The lead author of two 
high-profile stem-cell papers 
retracted from Nature in 

July could lose her PhD. 
Waseda University in Tokyo 
announced last week that it 
will revoke Haruko Obokata’s 
doctoral degree unless the 
biochemist corrects flaws 
found in her thesis within one 
year. In March 2014, Nature 
discovered that parts of the 
thesis had been taken from US 
National Institutes of Health 
materials. One image was also 
found to have been reproduced 
from a commercial website 
without attribution (see Nature 
507, 283; 2014). 


> NATURE.COM 
For daily news updates see: 
www.nature.com/news 


16 OCTOBER 2014 | VOL 514 | NATURE | 279 


© 2014 Macmillan Publishers Limited. All rights reserved 


SUZANNE KREITER/BOSTON GLOBE/GETTY 


NEWSIN FOCUS 


Giant genetic Robben Island Putting numbers . The 
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A research team led by Douglas Melton (left) has made insulin-secreting cells using human stem cells. 


REGENERATIVE MEDICINE 


Stem-cell success 
aids diabetes fight 


Now the challenge is to protect cell transplants from the 
immune systems of people with type 1 diabetes. 


BY HEIDI LEDFORD 


ach year, surgeon Jose Oberholzer frees 
E: few people with type 1 diabetes from 

daily insulin injections by giving them a 
transplant of the insulin-secreting B-cells that 
the disease attacks. But it is a frustrating pro- 
cess. Harvested from a cadaver’s pancreas, the 
B-cells are in short supply and vary in quality. 
And the patients must take drugs to suppress 
their immune response to the foreign cells, 
which can in turn cause kidney failure. 

On 9 October, stem-cell researcher Douglas 
Melton of Harvard University in Cambridge, 
Massachusetts, and his colleagues reported 
an advance that has the potential to overcome 
Oberholzer’s frustrations and allow many more 


people with type 1 diabetes to receive trans- 
plants. Melton and his team have achieved a 
long-term goal of stem-cell science: they have 
created mature B-cells using human stem cells 
that can be grown from a potentially unlim- 
ited supply, and that behave like the real thing 
(E W. Pagliuca et al. Cell 159, 428-439; 2014). 
The next challenge is to work out how to shield 
these B-cells from the body’s immune response. 

Researchers had previously created imma- 
ture B-cells from stem cells and transplanted 
them into diabetic mice. But they take months 
to mature into insulin-secreting cells, and it is 
unclear whether they would do so in humans. 

The B-cells reported by Melton’s team were 
grown from adult cells that had been repro- 
grammed to resemble stem cells. In response 


to glucose, the B-cells quickly secreted insulin, 
which the body uses to regulate blood sugar. 
When implanted in diabetic mice, the cells 
relieved symptoms within two weeks. The 
B-cells even formed clusters that are similar 
to those found in a pancreatic structure called 
the islet of Langerhans. “If you took these cells 
and showed them to somebody without telling 
them what they are, I guarantee you an expert 
would say that is a perfect human islet cell? 
says Oberholzer, who is working with Melton’s 
team to test the cells in non-human primates. 

A remaining hurdle is shielding the cells 
from immune attack. This is necessary if the 
treatment is to become more widely available, 
because immunosuppressant drugs can be jus- 
tified only in the most severe cases of diabetes. 
And although mature B-cells could be derived 
from a patient’s own skin cells, type 1 diabetes 
is an autoimmune disease, so transplanted cells 
would still be vulnerable to attack. 

One solution might be to encapsulate the 
cells in a credit-card-sized, biocompatible 
sheath made by ViaCyte of San Diego, Cali- 
fornia. The company will implant its first 
device — loaded with immature B-cells — ina 
patient on 21 October. Studies in animals have 
been promising, but some researchers worry 
that the cells inside the device are packed too 
densely and might become starved of oxygen 
and nutrients. 

Another option is to coat cells in a protective 
hydrogel, which results in thousands of sepa- 
rate balls of cells. But a potential drawback is 
that it would be much harder to remove such 
cells if there was a safety concern, says Albert 
Hwa, director of discovery science at JDRF, a 
diabetes-research foundation in New York. 

Neither technique avoids the body’s 
tendency to enclose foreign bodies inside scar 
tissue, which could cut the transplanted cells off 
from nutrients. Bioengineer Daniel Anderson 
of the Massachusetts Institute of Technology in 
Cambridge and his team are screening chemical 
compounds for a hydrogel that does not trigger 
this. Some, used with Melton’s cells, have shown 
promise in unpublished studies of diabetic 
primates, he says. 

Still, for those people with diabetes who 
face life-threatening changes in blood-sugar 
level each day, mature B-cells could offer a big 
improvement without such devices, says Ober- 
holzer. Many of his patients are relieved to be 
free of insulin injections: “They would much 
rather take immunosuppression,’ he says. = 


16 OCTOBER 2014 | VOL 514 | NATURE | 281 


© 2014 Macmillan Publishers Limited. All rights reserved 


Vast stores of DNA samples and data have been produced by the increasing pace of genetic sequencing. 


Giant gene banks 
take on disease 


Researchers bring together troves of DNA sequences in the 
hope of teasing out links between traits and genetic variants. 


BY ERIKA CHECK HAYDEN 


arly last year, three researchers set out 
EH: create one genetic data set to rule 
them all. The trio wanted to assemble 

the world’s most comprehensive catalogue of 
human genetic variation, a single reference 
database that would be useful to researchers 
hunting rare disease-causing genetic variants. 
Unlike past ‘big data’ projects, which have 
involved large groups of scientists, this one 
deliberately kept itself small, deploying just five 
analysts. Nearly two years in, it has identified 
about 50 million genetic variants — points at 
which one person’s DNA differs from another’s 
— in whole-genome sequence data collected 
by 23 other research collaborations. The group, 
called the Haplotype Reference Consortium, 
will unveil its database in San Diego, Califor- 
nia, on 20 October, at the annual meeting of the 


2 


MORE 
ONLINE 


TOOLBOX Q&A 


TOOLBOX 


American Society of Human Genetics. 

Geneticists have not always been so willing 
to share data. But that seems to be changing. 
“It's been surprisingly easy to bring all these 
data sets together,” says Jonathan Marchini, 
a statistical geneticist at the University of 
Oxford, UK, and one of the consortium’s lead- 
ers. “There is a lot of goodwill between the 
people in the field; they all understand the 
benefits of doing this and have worked hard to 
make their data available” 

In the past five years, there has been an explo- 
sion in rates of sequencing human genomes 
thanks to the falling cost of the technology. At 
the same time, geneticists have realized that 
linking genes to diseases and traits will require 
much bigger sample sizes than any one research 
centre can assemble. 

It was once assumed that common diseases 
and traits could be traced to a few common 


| MORE NEWS | 
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genetic variants that would be relatively easy 
to find. But that has turned out not to be the 
case. It is now clear that thousands of different 
variants each play a small part in determining 
a person’ height or risk of schizophrenia, for 
example. And finding those thousands of vari- 
ants means looking at a daunting number of 
people. At the same time, the increased pace of 
genetic sequencing has made it possible to col- 
lect enough genomes to uncover those variants. 

“Here are a bunch of data sets that individu- 
ally cost millions of dollars to generate, and you 
have people willing to make that data available 
to a shared resource, which is amazing,’ says 
geneticist Daniel MacArthur of Massachusetts 
General Hospital in Boston. 

MacArthur is part of the Exome Aggrega- 
tion Consortium, another attempt to create a 
supersized library of human genetic variation. 
On 20 October, MacArthur and his colleagues 
plan to unveil their own public database con- 
taining the protein-coding portions, or exomes, 
of 63,000 human genomes originally gathered 
by other researchers. “We can say from looking 
at a very large cohort of people ... this is what 
the distribution of rare variation looks like,” 
says MacArthur. “And that is very powerful” 

MacArthur is developing tools to comb the 
data for mutations that disable genes. Only 
some of these ‘loss-of-function mutations 
cause harm; predicting which are pathogenic 
will require knowing more about which ones 
regularly occur in healthy people. 

Some studies are already reaping the benefits 
of huge data sets. On 5 October researchers 
published a paper on the genetics of height that 
included data on more than 250,000 people 
(A. R. Wood et al. Nature Genet. http://doi.org/ 
v6k; 2014). The data had been gathered in sep- 
arate genome-wide association studies, which 
look for correlations between genetic variants 
and traits or diseases, and pooled as part of the 
Genetic Investigation of Anthropometric Traits 
(GIANT) Consortium. The paper reported 
697 new variants linked to height, more than 
tripling the previous count. Still, researchers 
estimate that the hundreds of common vari- 
ants now identified account for only 16% of the 
genetic contributors to height. 

Throwing even more data into the pool 
could reveal some of the rest, says Joel 
Hirschhorn, a geneticist at the Broad Institute 
in Cambridge, Massachusetts, and a leader of 
the GIANT consortium. m 
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IN FOCUS | NEWS 


African penguins put 
researchers in a flap 


Controlled fishing experiment raises controversy over cause 
of birds’ decline on Robben Island. 


BY MICHAEL CHERRY IN CAPE TOWN 


obben Island is notorious as the site 
Rives Nelson Mandela spent 18 years in 

prison, but now the island’s 1,200 breed- 
ing pairs of African penguins are sparking a 
scientific controversy. At stake is the survival of 
an endangered species, as well as how fisheries 
around the world are managed. 

In 2013, there were just 22,000 breeding pairs 
of African penguins (Spheniscus demersus) 
worldwide: the population had declined by 65% 
since 2001. Possible causes include pollution, 
habitat loss and climate change, but a key sus- 
pect is fishing of anchovies and sardines, which 
are important prey for penguins. 

To test this theory, in 2008, the now-defunct 
South African Department of Environmental 
Affairs and Tourism began an unusual experi- 
ment involving two pairs of islands: Robben 
and Dassen islands, off South Africa’s west 
coast; and St Croix and Bird islands, off the 
country’s south coast. For three years, a zone 
around one island in each pair was closed to 
fishing while the other island remained open. 
Then the situation was reversed. The rare 
controlled experiment has “important impli- 
cations for fisheries worldwide in competition 
with vertebrate predators’, including seabirds, 
seals and dolphins, says Johann Augustyn, sec- 
retary of the South African Deep-Sea Trawling 
Industry Association in Cape Town. 

When fishing was restricted around St Croix 
and Bird, penguins had to expend less energy on 
foraging to feed their chicks, so they and their 
chicks were more likely to survive (L. Pichegru 
et al. Biol. Conserv. 156, 117-125; 2012). The 
areas surrounding those islands will soon be 
closed to fishing permanently. But for Robben 
Island, no clear pattern has emerged. Scien- 
tists at the University of Cape Town’s Marine 
Research Institute, where most of the data from 
the experiment are analysed, are fiercely debat- 
ing whether the alternating closures of Robben 
and Dassen islands should continue. 

Those who want more time argue that it is 
hard to determine the effects of island closure 
on penguins, because natural variations in 
fish abundance can swamp the signal. Extend- 
ing the closures for another three years should 
allowa signal — if there is one — to emerge, says 
Richard Sherley, an ornithologist at the institute. 


African penguins on Robben Island, South Africa. 


Marine ecologist Astrid Jarre agrees that 
shutting the fisheries for another cycle would 
bea sensible precaution. The restrictions have 
led to small increases in chick survival, she says. 

But some scientists are calling for an end to 
the experiment, saying that the closures dam- 
age the livelihoods of people in the fishing 
industry. Doug Butterworth, director of the 
institute’s Marine Resource and Assessment 
Management Group, suggests that fishing may 
even help the penguins of Robben and Dassen 
islands, by breaking up shoals of anchovies and 
making the fish easier for penguins to catch. 

Population-modelling studies of both fish 
and penguins by William Robinson, a former 
PhD student in Butterworth’s group, flag up a 
problem that may be much worse for the pen- 
guins than fishing: many sardines are failing 
to reach maturity, and the distribution of the 
fish is shifting eastwards. The causes of these 
trends are not well understood. 

A workshop in Cape Town on 13-17 October, 
organized by BirdLife South Africa, is evalu- 
ating the evidence from the island-closure 
experiment. An international panel is expected 
to make a recommendation to the South Afri- 
can Department of Agriculture, Forestry and 
Fisheries in the first week of December. m 
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The Ebola outbreak in West Africa continues to 
rage, with the number of people infected roughly 


are thought to have contracted the disease, and 
almost half of those have died, according to the 
World Health Organization. Although these 
estimates are already staggering, the situation on 
the ground means that not all cases and deaths 


much greater. 

Outside of Africa, a health-care worker in Texas 
has become infected while treating a patient who 
was hospitalized in Dallas after travelling from 
Liberia and who has now died. And a nurse in 
Madrid has contracted the virus after caring for a 


for patients in West Africa. Health-care workers 
remain one of the groups at highest risk of 
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GUINEA-BISSAU 


and 233 had died. 

The spread beyond the epicentre of Guinea, 
Liberia and Sierra Leone remains limited. Apart 
from the people in Dallas and Spain, only two 


SENEGAL 


A RISING TOLL 


The number of Ebola cases continues to rise because control 
measures in the outbreak area are insufficient. But it could drop 
quickly if the international community and affected countries 


manage to implement an effective response. ; ° 
Liberia m Sierra Leone Guinea @ 
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doubling every 3-4 weeks. More than 8,000 people 


are being reported, so the true extent is likely to be 


missionary who had become infected while caring 


exposure: by 8 October, 416 had become infected 


other exported cases are known: one in Nigeria 
and one in Senegal. A man who travelled to Lagos 
from Liberia sparked a further 19 cases in Nigeria, 
but that outbreak was curtailed by the swift actions 
of the authorities in tracing and monitoring those 
who had contact with the infected man. Similar 
public-health measures stopped further cases in 
Senegal after an infected man travelled from 
Guinea to Dakar. 

Within the epicentre, authorities have made 
some progress in slowing transmission — but the 
disease is resurgent in places where it had seemed 
under control, such as in Conakry, Guinea’s capital. 

Meanwhile, the estimated cost of fighting the 
disease is spiralling upward. UN secretary-general 
Ban Ki-moon warned on 9 October that “at least a 
20-fold surge in assistance” was needed to 
confront the outbreak. But “things will get worse 
before they get better", he warned. Just how much 
worse will depend on the international community 
— which has been widely criticized for its belated 
response, and its slow translation of pledges into 
concrete action. 
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InFocus [aS 


Ebola is spread by contact with an infected person's bodily fluids, but is less 
contagious than many common diseases, such as mumps and measles. In 
the current outbreak, each person with Ebola will infect 1-2 other people. 


Ebola SARS HIV Mumps Smallpox Pertussis (whooping cough) Measles 
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GLOBAL REACH 


Modelling of historical air-passenger volumes and flight networks can point to 
international destinations where a traveller with Ebola might end up. 


CLOSE CONTACTS 


Monitoring people who have had direct contact with someone with Ebola 
symptoms for 21 days — the incubation period — can help to prevent further 
transmission of the virus. But the contacts are difficult to trace in regions with a 
high density of cases, especially when resources are limited. 
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If outbreaks are allowed to grow, they become more difficult and costly to control. In April, 
the World Health Organization estimated that it would cost US$4.8 million to contain the 
Ebola outbreak, but by September that figure had ballooned to almost $1 billion. Experts say 
that the total cost of ending this outbreak is likely to be higher still. 
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Chemical visionaries: (from left) Stefan Hell, Eric Betzig and William Moerner. 


NOBEL PRIZE 


Insider view of 
cells scoops Nobel 


Optics pioneers win chemistry prize for defying limits of 


conventional microscopes. 


BY RICHARD VAN NOORDEN 


ver since the seventeenth century, when 
Hpi microbiologist Antonie van 

Leeuwenhoek focused light through 
lenses and marvelled at the cells that swam 
before his eyes, microscopes have been at the 
heart of discovery. This year, the Nobel Prize in 
Chemistry went to three scientists who defied 
the limits of light microscopes to reveal images 
of molecular-scale structures in living cells. 

The advances made by Stefan Hell, William 
Moerner and Eric Betzig in the 1990s and 2000s 
allow biologists to watch in real time how pro- 
teins are distributed and move inside cells — at 
the junctions between neurons, for example, or 
in fertilized eggs dividing into embryos. 

“Tt is really a revolution for the life sciences, 
because we can see structures that we could 
never see before,” says Stefan Jakobs, who 
works with super-resolution techniques at the 
Max Planck Institute for Biophysical Chemis- 
try in Géttingen. Or as the Nobel committee 
put it: “Microscopy has become nanoscopy.’ 

No matter how clean their lenses, optical 
microscopes inevitably provide a blurry view 
of the molecules inside cells, as German physi- 
cist Ernst Abbe realized in 1873. The laws of 
physics dictate that visible light cannot dis- 
tinguish between objects closer to each other 
than around 200 nanometres (around half 
the wavelength of visible light) — they will 
appear as one blob. Such resolution, known 
as Abbe’s diffraction limit, is good enough to 


reveal the organelles inside cells but not to see 
their detailed structures. Microscopes that 
use beams of electrons, rather than light, have 
finer resolution, but they can be used only ina 
vacuum, limiting their use to dead tissue. 

Abbe’s limit cannot be overcome, but the 
2014 Nobel prizewinners pioneered ways to 
work around it using fluorophores, or fluores- 
cent molecules. Now routinely used in biologi- 
cal imaging, fluorophores emit light when hit 
by lasers of a certain wavelength. 

In 1989, William Moerner, now at Stanford 
University in California, but then at the IBM 
Almaden Research Center in San Jose, detected 
the faint fluorescence of a single molecule. In 
1997, while working at the University of Cali- 
fornia, San Diego, he found a way to control 
the fluorescence and switch the molecules on 
and offlike lamps. Still, these single molecules 
could be distinguished only if they were more 
than 200 nanometres apart. 

Two years earlier, Eric Betzig, then working 
at Bell Labs in Murray Hill, New Jersey, had 
proposed that if different molecules inside a 
cell could be made to glow with different col- 
ours, researchers should be able to increase the 
resolution by taking a series of snapshots — 
first the red molecules, then the green, then the 
blue. Any fluorophores 
of the same colour would 
have to be more than 200 
nanometres apart, but the 
superimposed images 
would produce a much 
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finer-resolution structure. Moerner went on to 
show that identical molecules could be made 
to fluoresce at different times, a discovery that 
ultimately made Betzig’s vision a reality. 

It was another decade before Betzig demon- 
strated his idea in practice. In 2006, working at 
the Howard Hughes Medical Institute's Janelia 
Farm research campus in Ashburn, Virginia, he 
took a super-resolution picture of a lysosome 
protein dotted with green fluorescent molecules 
as labels. The technique can now get down toa 
resolution of 20 nanometres, says Markus Sauer, 
who studies super-resolution microscopy at the 
University of Wiirzburg, Germany. 

Meanwhile Stefan Hell, working at the Uni- 
versity of Turku in Finland, had found a way 
around Abbe’ limit by a different technique, 
which also relies on switching fluorescent mol- 
ecules on and off. In 1994, he proposed using 
one laser to make a cluster of dye molecules flu- 
oresce, and a second beam, ofa different wave- 
length, to switch some of those fluorophores off. 

Hell’s trick is to use the second beam to 
outline the cluster illuminated by the first, so 
that only the molecules in a very narrow spot 
fluoresce. The final image remains blurred, as 
light still cannot beat Abbe’s limit, but it is clear 
that light can have come only from the narrow 
central spot defined by the second beam, ena- 
bling researchers to pinpoint the light source. 

Building up a series of these tiny fluorescent 
spots creates a fine-resolution picture. In theory, 
the central spot can be made as small as a few 
nanometres across, but in living cells, the limit 
is around 30 nanometres, Sauer says, because 
it is at this stage that fluorophores are usually 
destroyed by the intensity of the second beam. 

“Tt was my view, at least, that so much phys- 
ics happened in the twentieth century that it 
was impossible there was no phenomenon 
that would allow you to overcome the diffrac- 
tion barrier,’ Hell, who now works at the Max 
Planck Institute for Biophysical Chemistry, 
told the Nobel committee. 

The techniques devised by the prizewinners 
are used by many biologists. Xiaowei Zhuang, 
a chemist at Harvard University in Cambridge, 
Massachusetts, has invented a variation called 
stochastic optical reconstruction microscopy, 
and has used it to show how filaments of the 
protein actin wrap around nerve cells. “There 
will be many new versions of super-resolution 
microscopes,’ Hell says. m 


CORRECTION & CLARIFICATION 

The News story ‘Marmosets are stars of 
Japan’s ambitious brain project’ (Nature 
514, 151-152; 2014) misspelled Afonso 
Silva’s name. And the Toolbox story 
‘Scientific writing: the online cooperative’ 
(Nature 514, 127-128; 2014) should have 
noted that although Fidus Writer does not 
record the detailed history of every single 
edit, users can save time-stamped versions. 
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Universities must evolve if they are to survive. A special issue 
of Nature examines the many ways to build a modern campus. 


hen the first universities 
emerged in eleventh-century 
Europe, their mission was 
education, scholarship and 
nothing else. They housed 
bright young clerics, study- 
ing the newly rediscovered works of ancient 
thinkers such as Aristotle and Euclid. Only in 
the nineteenth century, following the lead of 
Britain and Germany, did universities begin to 
give equal weight to a second mission: scien- 
tific research. 

But in the past few decades, universities 
around the world have begun to take on fur- 
ther missions. Today they are supposed to be 
not only centres of education and discovery, 
but also engines of economic growth, bea- 
cons of social justice and laboratories for new 
modes of learning. 

In the face of these sometimes conflicting 
requirements — not to mention financial pres- 
sure from cash-strapped governments — today’s 


universities are evolving and changing at an 
unprecedented pace. In this special issue, Nature 
looks at some of the myriad ways in which uni- 
versities around the world are trying to free 
themselves from old habits of thought, and to 
explore new ways of doing things. 

One perennial issue is the departmental 
structure that keeps researchers mentally 
and physically separated. Two articles look at 
US attempts to tackle that problem: the first, 
on page 292, describes how Arizona State 
University in Tempe is aggressively promoting 
interdisciplinary centres; and the second, on 
page 297, discusses efforts to facilitate the com- 
mercialization of research by putting scientists 
from industry in the same buildings as their 
academic counterparts. 


THE UNIVERSITY EXPERIMENT 


A Nature special issue 
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A second challenge is the ivory-tower 
mindset that leads faculty members to dis- 
dain commercial activity. A Comment (see 
page 295) reveals efforts in China to intro- 
duce a Western-style tenure system that will 
encourage innovation and risk-taking. Other 
countries are grappling with their own educa- 
tional legacies, and a News Feature explores 
some of the diverse efforts to institute change 
(see page 288). A South African university is 
attempting to overcome the legacy of apart- 
heid, for example, whereas one in South Korea 
is throwing out ineffective teaching methods 
such as mass lectures. There is plenty more 
content at nature.com/universities. 

No one knows which of these experiments 
will produce the best-educated students or 
the greatest leaps in academic understanding 
(see page 273). But all share the sentiment that 
the twenty-first-century university could be 
dramatically different from the institutions of 
the past. = 
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CAMPUS AS LABORATORY 


MODERN UNIVERSITIES ARE HEIRS TO A THOUSAND- 
year tradition of scholarship. But they are also being buffeted by 
twenty-first-century upheavals in technology, economics and soci- 
ety. Through trial, error and experiment, they are now trying to 
find new ways of thinking and acting that will help them to prosper. 


Innovative ways of teaching, 
learning and doing research 
are helping universities 
around the globe to adapt to 
the modern world. 
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president of the Technical University of Munich (TUM) in 
1995, he was determined to challenge an academic status 

quo that had prevailed for more than two decades. 
Germany had responded to the social upheaval of the 1960s by 
declaring that all universities were equivalent and taking steps to 


W hen chemist Wolfgang Herrmann began his first term as 
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prevent the development of a privileged elite, a move that tended to under- 
mine any competitive spirit in the faculty. New rules had also guaranteed 
a place for any student with a school-leaving certificate — which meant 
that universities had no say in who took their courses — and kept fac- 
ulty members bound to bureaucratic civil-service laws. The result was 
an inward-looking ivory-tower culture that had stagnated intellectually 
and financially. 

Herrmann’s vision was to turn the TUM into a nimbler, more inter- 
nationally competitive ‘entrepreneurial university’ that would encour- 
age innovation, risk-taking and business initiative among students and 
faculty members alike. To do that, he restructured the TUM along 
the lines of successful US institutions such as the Massachusetts Insti- 
tute of Technology (MIT) in Cambridge. In 1999, he made one of his 
first — and, within Germany, pioneering — reforms by installing a 
board of trustees that replaced the Bavarian education ministry’s direct 
control of the TUM and allowed for much quicker decision-making. 
Since then, he has used that freedom to introduce some of the first 
German graduate schools: institutions that provide PhD candidates 
with rigorous common standards for coursework, instead of leaving 


them to the vagaries 
THE UNIVERSITY EXPERIMENT 


of individual super- 
A Nature special issue 


visors. Herrmann 
has also created a 
private fund-raising 


FEATURE 


foundation to allow flexible and independent financing of some uni- 
versity projects; formed an institute of advanced studies; and launched 
a tenure-track system that obliges the university to promote and per- 
manently employ academics who make the grade, and sack those who 
do not. The latter system is a familiar concept in the United States, but 
revolutionary in Germany. 

The changes did not go down well at first with some faculty mem- 
bers, who were uncomfortable with the perceived emphasis on applied 
research and commercial pay-off at the expense of basic research. But 
the discontent has faded as the university's scholarly output has soared, 
from 2,276 publications in 2002 to 5,827 in 2013. The TUM’s funding 
from government agencies and industry — nearly €300 million (US$380 
million) this year — is among the highest in Germany. 

In 2012, Herrmann was re-elected to his post for his fourth consecu- 
tive six-year term by a university board that includes representatives 
from the faculty, students, non-academic staff and the surrounding 
community. He has announced that this term will be his last. But his 
unusually long tenure, which will total 24 years when he leaves in 2019, 
has given him time and clout to push the regional Bavarian government 
to relax one restriction on the TUM after another. “Now that I know 
virtually everyone in politics and government, they are sometimes afraid 
to say ‘no’ to me — because they know others will ask them why they are 
being uncooperative,’ he says, only half-joking. 

When the federal government introduced its Excellence Initiatives 
— competitions in 2006 and 2012 designed to encourage universities 
to actively shed their restrictions and win elite status (see Nature 487, 
519-521; 2012) — it gave other German universities an incentive to 
undertake reforms. But nowhere have those changes proceeded as rap- 
idly as at the TUM, which was a winner in both competitions. Bavaria 
has agreed to pay one-quarter of the running costs of the TUM’s Excel- 
lence Initiative projects when the federal money runs out in 2017. 

“This new culture is now ingrained,” says Herrmann. “The next gen- 
eration of leadership will continue in this vein” 


BY MARK ZASTROW 


do in lectures: anything but lecture. “Usually, in a conventional 
classroom, students don't think,’ he says. “They just follow the pro- 
fessor’s teaching.” 

So at the KAIST science and technology university in Daejeon, South 
Korea, where Lee heads the Center for Excellence in Teaching and 
Learning, he is working to implement a ‘flipped classroom. Instead of 
sitting through endless one-way lectures, students watch online lessons 
at home, and then come to the classroom to discuss the concepts and 
work on problems in small groups. Teaching assistants and the lecturer 
are there to supervise — but most of the learning happens among the 
students themselves. Lee calls this Education 3.0, and sees it as a way 
to spark creativity, teamwork and the willingness to ask questions, all 
of which are suppressed by the nature of lecturing — and, say many, by 
South Korea's hierarchical society. 

KAIST is not the first university to try out this concept, but strong 
support from its administration has made it a leader in the flipped- 
classroom movement in just two years. From 3 pilot classes in the spring 
of 2012, the effort has grown to nearly 60 classes this autumn. And over 
the next 3 years, Lee hopes to raise that to 800 classes, 30% of KAIST’s 


T ae-Eog Lee has a simple philosophy about what academics should 
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“Institutions want to see: 
how do you do this, how do you 
rise so quickly?” 


total. Observers at institutions elsewhere are impressed by the magni- 
tude of KAIST’s efforts. “They’re changing culture at a massive scale,” 
says Sanjay Sarma, director of digital learning at MIT. 

That is the sort of cultural shift that KAIST has been seeking since 
the early 2000s, when the Korean government began overhauling the 
university to compete in a globalized world. The reform effort took 
off in 2006, with the selection of Nam Pyo Suh, a Korean American 
mechanical engineer at MIT, as president of the university. Suh crafted 
an environmental and sustainability initiative that brought a wave of 
government funds and elicited private donations. This enabled the uni- 
versity to go ona hiring spree, recruiting lots of young faculty members 
— who in turn brought in grants. KAIST’s position in Times Higher 
Education’s global university rankings skyrocketed, from 198th at the 
beginning of Suh’s tenure to 69th three years later. 

But faculty members soon began to protest, striking out at Suh’s 
unsparing performance assessments and his insistence on instruction 
in English instead of Korean. Then, in early 2011, four students commit- 
ted suicide within three months, rocking the institution to its core. The 
tragedies put another of Suh’s reforms under scrutiny. In an attempt to 
raise both standards and funds, he had started levying the university’s 
first tuition charges — but only on students who earned poor grades. 
Those who succeeded academically would continue to pay nothing. 

Students say that the social stigma of paying off low grades amplified 
the already hyper-competitive environment of KAIST — and of South 
Korean society at large, which has the highest suicide rates in the devel- 
oped world. Facing calls to resign, Suh apologized, scrapped the fees 
and reinstated instruction in Korean. He also expedited the launch of 
Education 3.0, “in part because I wasn't sure how long I would be there’, 
he says. (He was finally forced to resign in February 2013.) 
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Education 3.0’s flipped-classroom approach has been prospering. 
An estimated 30% of KAIST’s 10,000-strong student body have taken 
an Education 3.0 course so far, and their test scores are at least as good 
as those of students in standard classes. Most important to Lee, how- 
ever, are the intangible benefits. For example: 71% of the Education 3.0 
students report an improved understanding of the material, increased 
motivation and better concentration. But a significant minority remain 
unconvinced. “Presentation and discussion are not familiar to Korean 
students,” says Seong Keun Kang, a graduate student in nuclear and 
quantum engineering. “I’m not sure it is better than the original classes.” 

Still, other universities are following KAIST’s lead. Seoul National 
University, one of South Korea's most prestigious institutions, intro- 
duced its first flipped classes this year. 

Universities throughout Asia are watching KAIST, says Gerard 
Postiglione, who studies Asian higher-education development at the 
University of Hong Kong in China. According to the QS World Uni- 
versity Rankings, KAIST is now the second-best university in Asia. 
Institutions “want to see: how do you do this, how do you rise so 
quickly?” says Postiglione. 


UK: THE SOCIAL UNIVERSITY 


BY ELIZABETH GIBNEY 


wave of massive open online courses (MOOCs): recorded lectures 

that could be delivered on the web to tens or hundreds of thousands 
of students around the world for free. Other institutions scrambled to 
follow suit, and the media filled with hype about how MOOCs would 
spark a total transformation of higher education. 

Mike Sharples took such rhetoric with a grain of salt. But he works 
at the Open University in Milton Keynes, UK, which has been deliver- 
ing courses to students around the world by post, television and com- 
puter for some 40 years — and the university was determined not to 
be outdone. By 2012, Sharples, chair of educational technology at the 
university, had joined with a team of fellow British academics to create 
next-generation MOOCs inspired by the work of the late Gordon Pask: 
a British educational psychologist who believed that students construct 
their knowledge through mutual interactions. The new MOOCs would 
put social engagement at the centre of learning, and encourage con- 
versations as intense as those in online games. “It was something of 
a gamble,” says Sharples. “It seems obvious in retrospect that people 
would want to talk about their learning, but it wasn’t obvious a year ago.” 

The first 36 of the new MOOCSs were developed last year by various 
partner institutions and offered through FutureLearn, a wholly owned 
subsidiary of the Open University. The catalogue has expanded greatly 
since then, and now ranges from Introduction to Forensic Science to 
England in the Time of King Richard III. The MOOCs enable discus- 
sions on every single piece of content, allowing users to ‘like’ comments 
or follow those posted by particular classmates, as in a standard social 
network, and even letting students assess each other’s work. The Future- 
Learn software is designed to work on tablets and mobile phones, as 
well as desktop or laptop computers. And the courses often include 
strong storytelling elements — a prime example being the forensic- 
science course, which was developed by the University of Strathclyde 
in Glasgow, UK, and which leads students through the material using 
an unfolding plot about a murder scene. 

FutureLearn now has 40 partners, 10 of them outside the United 
Kingdom. Data on its early courses show that some 22% of students 
who start a FutureLearn MOOC complete the majority of steps and 
all assessments. This figure drops to 12% when the count includes all 
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the students who enrol in a course but never start, but it still compares 
favourably to other MOOCs, which average less than a 7% completion 
rate. (Detailed comparisons are difficult, because each MOOC provider 
has a different definition of ‘completiom) 

FutureLearn’s MOOCs also get high marks from outsiders such as 
Sally Mapstone, pro-vice-chancellor for education at the University of 
Oxford, UK. Although Oxford has elected not to joina MOOC platform 
— and Mapstone has doubts about such courses’ potential to revolution- 
ize education — she says she does admire FutureLearn’s “simple and 
attractive” approach. 

In many ways, FutureLearn is still trailing the first wave of US MOOCs 
(see Nature 495, 160-163; 2013). It has more than 500,000 registered users 
and 130 courses — whereas leading MOOC company Coursera, founded 
in April 2012 by computer scientists at Stanford University in California, 
has almost 10 million registered users and more than 400 courses. Anant 
Agarwal, chief executive of edX, a MOOC provider in Cambridge, Massa- 
chusetts, that has around 3 million users, says that FutureLearn’s approach 
is creative. But his platform too is “evolving at a torrid pace’, he says, using 
student feedback to improve how discussion forums and cohorts work. 

“We need to experiment a whole lot more with hundreds of courses 
and millions of users before generalizing” about what works best for the 
students, says Agarwal. 

And Sharples, for one, is eager to do just that. 


SOUTH AFRICA: THE INCLUSIVE UNIVERSITY 


BY LINDA NORDLING 


tion, the country’s leading universities catered mostly to the white 

elite. Shortly before the apartheid system was dismantled in the 
early 1990s, however, the University of Cape Town (UCT) joined with 
a number of other South African universities in reaching out to impov- 
erished students — the vast majority of whom were black. 

The general idea behind the UCT’s programme has been to help stu- 
dents from disadvantaged backgrounds to acquire the skills that their 
wealthier contemporaries take for granted. It provides support includ- 
ing language-development courses for those whose first language is 
not English, instruction in good study habits and even psychological 
counselling. It also includes group sessions that let students discuss chal- 
lenges ranging from how to manage their personal finances to ways to 
cope with stress. 

For science students, the UCT offers foundation courses in biology, 
physics, chemistry and mathematics to patch any knowledge gaps. A 
winter science programme runs trips to Cape Town’s aquarium and 
nearby fossil parks, and provides other science-related experiences 
that students may have missed while growing up. To make time for 
these extra activities, the UCT’s Bachelor of Science programme gives 
students the option of stretching the normal three-year undergraduate 
curriculum to four years. 

Since they were introduced in 1986, the UCT’s four-year under- 
graduate courses have trained more than 2,000 students. Mokete Koago 
was one: he enrolled in what was then known as General Entry to 
Programmes in Science (GEPS) when he came to the UCT in 2008. A 
bright student from a poor township in South Africa’s rural Free State 
province, Koago found the extra time, tutoring and mentoring essen- 
tial. “I don’t think I would have made it through my degree without 
GEPS,” he says. 

The programme is still evolving. Until last year, for example, under- 
graduates on science courses were channelled into three- or four-year 
streams as soon as they enrolled. Now, all students start in the same 


D uring most of South A frica’s apartheid era of strict racial segrega- 
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“I want to bridge the gap 
between people living in 
the townships and the science. 


course. Only after six weeks do they choose whether to stay on the three- 
year track or opt for the four-year Extended Degree Programme. 

The idea, says David Gammon, a UCT chemist who serves as a senior 
advisor for the extended science programmes, is to let students’ paths at 
university be determined by their performance rather than where they 
went to school or the colour of their skin. This approach also means 
that students are actively involved in choosing their own paths — an 
important consideration given that there could be a stigma attached to 
joining the longer course. 

Transformation is proving slow. A report on undergraduate- 
curriculum reform published by South Africa’s Council on Higher 
Education in 2013 found that, although the fraction of the country’s 
black 20-24 year olds attending university has risen slightly, from 10% 
in 2005 to 14% in 2011, it is still dwarfed by the figure for white people: 
57%. And of those black students who do make it to university, only 
one in five completed their undergraduate degrees within four years, 
as opposed to 44% of white students. 

Still, there have been many individual successes. During Koago’s four 
years at UCT, for example, he discovered a passion for meteorology, 
climate and ocean science — an unexpected love for a boy who had 
grown up in the dusty interior of South Africa. “When my parents came 
down for my graduation, it was the first time in their lives that they saw 
the sea,’ he says. He is now a research assistant in the UCT’s Climate 
Systems Analysis Group, and he hopes to embark on a master’s degree 
in oceanography next year. 

Eventually, Koago hopes to bring his passion for science home — and 
perhaps to inspire other young people to follow in his footsteps. “I want 
to bridge the gap between people living in townships and the science,” he 
says. “The biggest problem out there is that people are ill-informed.” = 
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Arizona State University is trying 
to reinvent academia by tearing 
down walls between disciplines. 


BY JOSH FISCHMAN 


orlds both familiar and strange come together inside a 
We glass-walled room at Arizona State University in 
Tempe. Images of the Moon’s surface fill giant screens as 
planetary geologist Jim Bell shows off panoramas from one of the 
university's cameras, which is currently flying on a lunar orbiter. 
Bell, tall and enthusiastic, gets even more animated when he talks 
about plans to visit an odder place: an asteroid named Psyche made 
almost entirely of iron. Researchers are keen to explore it because it 
is essentially a naked version of Earth’s metallic core, something that 
scientists have never seen. 
Designing a mission to study a rapidly spinning hunk of iron more 
than 255 million kilometres from Earth calls for close collaboration 
between scientists and engineers. Bell finds that kind of coordination 
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easier at Arizona State University (ASU) than when he worked at 
Cornell University in Ithaca, New York, on the Mars rovers. 

At Cornell, “the engineers were someplace else on campus’, he says. 
“So youd come up with an idea for an instrument, kind of toss it over 
the wall, and then a year later they’d toss a design back to you that may 
or may not work, scientifically: But at ASU, Bell works at the School 
of Earth and Space Exploration (SESE), which includes engineers and 
computer scientists. “They are people who are interested in the same sci- 
ence I’m interested in, and we get things done faster and, I think, better” 

The exploration school, formed in 2006 from the former departments 
of astronomy and geology, is the most striking embodiment of the ambi- 
tious vision of Michael Crow, who took over as president of ASU in 2002 
with the goal of turning a public university with a middling reputa- 
tion into something much greater. ASU was not known for exceptional 
scientific research, and attracted students mainly from within the state. 

Crow has sought to transform ASU’s research and education by tear- 
ing down walls between traditional academic departments and bringing 
together disparate disciplines to tackle large issues such as exploring 
the Solar System, finding alternative ways to attack cancer and solving 
problems that matter to Arizona as well as the rest of the world, such as 
severe water shortages. Crow has travelled extensively, talking up what 
he calls the “New American University” that is taking root in the desert. 
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“We're going to best serve our students, and the world, by preparing 
them to tackle the big problems of the modern age,” he says. 

More than a decade into his tenure, the results are mixed. On the 
positive side, ASU has more than doubled the amount of federal money 
it attracts for research. And the culture at the university has shifted to 
make research and education more interdisciplinary. “I think some of 
the things Arizona is doing could have a real impact,’ says Daniel Fisher, 
a physicist at Bio X, a multidisciplinary institute at Stanford University 
in California. 

But seen from another perspective, the changes at Arizona are mod- 
est shifts — layering new institutes on top of traditional departments, 
for example. And the reinvention effort may not have substantially 
improved the quality of ASU’s research. An analysis of scholarly output 
conducted by Nature shows that ASU’s record has improved by some 
measures, such as the number of papers published, but the university 
has gained little ground compared with similar institutions. 

The results underscore how hard it is for large universities, which 
employ thousands of researchers, to alter their fundamental character 
by uprooting entrenched academic disciplines. Even Crow says that “the 
biggest challenge that we've had has been the strength of ‘the invisible’ 
colleges — the fact that people show more allegiance to their disciplines 
and the structure of those disciplines than to the institution they are a 
part of”. 


CHANGE AGENT 
Still, the signs of change are all over the university — literally. Big placards 
in hallways announce “A New American University” with eight ambi- 
tious calls to action. “Fuse Intellectual Disciplines” is one, along with 
“Transform Society’, “Value Entrepreneurship’, “Enable Student Success’, 
and “Conduct Use-Inspired Research” The campus itself has a modern, 
utilitarian look: large buildings with clean lines, many topped with solar 
panels. Construction cranes poke into 
the sky as they continue a building boom 
that has been under way ever since Crow 
arrived. Throngs of students thread their 
way around them — ASU has the largest 
undergraduate and graduate enrolment 
of any public university in the country, at 
about 76,000. 

There are a lot of new faculty faces 
as well. Nearly 500 of ASU’s 1,700 or so 
tenure-track faculty have been hired in the 
past ten years — the turnover has largely 
resulted from normal retirements — and 
the university has deliberately sought peo- 
ple who work well with others and look beyond disciplinary walls. 

“ve worked at places where wed have pitched battles over lab space 
if room opened up,” says Cheryl Nickerson, a microbiologist at ASU’s 
Biodesign Institute, a cross-disciplinary centre dedicated to understand- 
ing how organisms are built down to the molecular level, and how that 
differs between health and disease. Nickerson, who sends bacteria on 
NASA missions and works with many physicists and engineers, says, 
“Here, I’m not saying we're perfect, but several times I’ve seen people 
give up space to accommodate a colleague with an expanding project.” 

All these changes are part of Crow’s grand vision for reinventing the 
university, and his tireless promotion of that vision has brought him to 
prominence in the world of higher education. He chairs or participates 
in several national committees, including an advisory council on inno- 
vation and entrepreneurship for the US Department of Commerce. And 
he travels the world to lecture at World Bank meetings and other inter- 
national gatherings. Much of what Crow talks about is how ASU has 
focused on replacing narrow academic divisions 


“THE GOALS HERE ARE 
GOOD ONES, AND YOU CAN 


TAKE ADVANTAGE OF NEW 
OPPORTUNITIES.” 
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the University of Pennsylvania in Philadelphia and author of the book 
In Defense of Disciplines (University of Chicago Press, 2013). 

Crow’s manner can be blunt and aggressive, says Joshua LaBaer, who 
left his position as head of Harvard University’s Institute of Proteom- 
ics in 2009 to work at the Biodesign Institute. But LaBaer says that the 
decisions by Crow and his team have generally been sound. “I don't see 
the faculty rankling under a loss of power,’ he says. “The goals here are 
good ones, and you can take advantage of new opportunities.” And of 
resources, too: in 2013, the US National Institutes of Health (NIH) gave 
ASU researchers some US$48 million, about $22 million of which went 
to the Biodesign Institute. By comparison, the university pulled in just 
under $20 million from the NIH in 2003. 

A substantial share of those resources have helped to build LaBaer’s 
unique facility for producing and analysing thousands of proteins, 
as part of efforts to understand their function and role in disease. In 
secure rooms full of automated machines, human cell cultures churn 
out full-length proteins in vials, then robotic arms whisk the molecules 
to machines that determine their sequence and structure. What sets 
LaBaer’s operation apart is the ability to manufacture and probe thou- 
sands of proteins before they lose their natural folding patterns and 
function. The scientists then compare the proteins to see which shapes 
and folds are linked to particular diseases. 

One priority for the university has been to boost biomedical research 
of this type — a tall order for an institution without a medical school. It 
has done so in part by forging close ties with the nearby Mayo Clinic in 
Scottsdale. That relationship helped ASU to attract LaBaer from Harvard. 

There were a lot of worries when Crow and his administrators first 
started to reshape the university. In 2005, for example, the anthropology 
department was incorporated into a new School of Human Evolution 
and Social Change, and anthropologists fretted that their discipline 
was going to be diluted into non-existence. But by 2011, according to 
anthropologist Alexandra Brewis, the 
number of faculty members in the school 
had risen by 40%, and three-quarters of 
them were anthropologists. The other 
research slots were occupied by applied 
mathematicians, epidemiologists, politi- 
cal scientists and human geographers. 

In 2010, Brewis and some colleagues 
surveyed all 54 tenured faculty in the 
school to find out who they collabo- 
rated with. The strongest partnerships, 
they learned, were still between tradi- 
tional sub-disciplines such as archaeol- 
ogy and physical anthropology. Many 
non-anthropologists in the school often had stronger ties to anthro- 
pology than they did to one another. So diversity within the school 
had not led to fragmentation, the researchers concluded, and all the 
disciplines were contributing to anthropological research. For example, 
a team of researchers is studying the western Mediterranean, an area 
that has supported dense populations as well as productive agriculture 
for thousands of years. The team is developing computer models that 
show how population size, economic behaviour and vegetation change 
in the region have affected the sustainability of natural resources, and 
how those resources are likely to fare in the future. 

ASU’s funding numbers show that grant-givers find the cross-disci- 
plinary approach attractive. From 2003 to 2012, the university's federally 
financed research portfolio grew by 162%, vastly outpacing the average 
increase seen at 15 similar public institutions, which were picked for 
comparison purposes by ASU’s governing board. And the money that 
ASU gets is supporting more interdisciplinary work than ever before. 
The number of funded projects with principal 


with big, bold structures. “Other leaders espouse 
this principle of interdisciplinarity, but Crow 
has gone the furthest in embracing it, and is the 
loudest voice,” says Jerry Jacobs, a sociologist at 
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investigators in two or more departments rose 
by 75% between 2003 and 2014, whereas pro- 
jects led by one department climbed by just 8%. 

A similar trend has occurred at Michigan > 
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> State University in East Lansing, another institution that has pushed 
for greater collaboration between disciplines. Stephen Hsu, the universi- 
ty’s vice-president for research and graduate studies, says that, like ASU, 
Michigan State has seen the value of shared projects. “Due to increased 
specialization, you have experts in specific techniques or types of analy- 
sis scattered among different departments,” he says. “To address many 


RAISING ARIZONA 


Research at Arizona State University has become more interdisciplinary, 
seen here by the increase in projects involving more than one department. 
Its scholarly output has risen sharply, but the university has not gained on 
peers in several other metrics. 
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really big problems, for example, climate change, you need teams with 
multiple skills, and therefore must transcend departmental boundaries.” 

But for all the changes, ASU has had limited success in raising its 
scientific profile relative to its peers —a least in terms of its publica- 
tion record. Using Elsevier's SciVal analysis tools, Nature compared 
the publications of ASU researchers to those at some of the same peer 
institutions identified by the university’s governing board. Over the past 
decade, ASU has more than doubled the number of articles it produces 
each year, the biggest percentage rise in its peer group. But because 
everyone increased their production substantially, and because ASU 
started near the bottom, the university moved up only slightly within 
the group. In climbed from fourteenth to twelfth place between 2003 
and 2013 (see ‘Raising Arizona). 


MIXED NUMBERS 

Other metrics suggest that ASU researchers are having mixed success 
in generating scholarly impact. The university ranks in the middle of 
its peer group in getting papers into the most cited scientific journals 
and broke into the top five for a couple of years during the past decade. 
Yet it generally comes in last place in producing papers that attract the 
most citations. 

George Raudenbush, ASU’s executive director of research analytics, 
argues that citation data are not the best measure of research quality. 
And he counters that the relative increase in publications is truly dra- 
matic. It shows that the university has come along way ina short time, 
given that it did not emphasize research as much before Crow’s arrival, 
he says. 

Beyond metrics, there are also questions about how profound the 
organizational changes at ASU really are, and whether they represent a 
major departure in higher education. Few traditional academic depart- 
ments have been eliminated; the university has simply established most 
of the new units on top of them. And most of the faculty members in the 
new schools and groups are actually tenured in traditional departments. 
(SESE is an exception.) 

In fact, some of what ASU has accomplished in terms of promoting 
interdisciplinary research can be seen at other, more staid institutions. 
“Traditional universities have research centres, and that’s where inter- 
disciplinary ideas get addressed,” says Jacobs. When he studied the top 
25 research universities in the United States, he found that they have 
about 100 research centres each, on average. 

But ASU’s administrators maintain that there is something unique 
happening there. By emphasizing new schools and institutes, rather 
than centres within disciplinary departments, the university has built 
conduits among very different specialities that encourage collaboration, 
says Crow. And hiring broad-thinking researchers and pairing them 
with practical technologists — engineers and computer scientists, for 
example — leads the way to addressing broad issues. 

As an example of something the university is doing differently, Crow 
points to its broad-based approach to cancer research. The universi- 
ty’s Center for Convergence of Physical Science and Cancer Biology, 
financed by the National Cancer Institute, brings astrobiologists and 
physicists together with oncologists and evolutionary biologists to 
explore how cancer starts and evolves (see Nature 474, 20-22; 2011) 

Some of the centre’s researchers have developed a theory that as a 
cancer spreads, it activates a series of ancient genes that were key to 
the success of the first multicellular organisms (C. Lineweaver, P. C. W. 
Davies, & M. D. Vincent et al. Bioessays 36, 827-835; 2014). The deep 
roots and robust genes might explain why some tumours are so hard 
to get rid of, the researchers propose. The idea implies that cancer is an 
organized response, rather than a series of genetic accidents. 

That line of enquiry, borne from an unusual marriage of disciplines, 
is unlikely to come from a typical university, says Crow. “We don’t want 
to ask the same questions as other institutions do.” m 


Josh Fischman is a senior editor at Scientific American. (Nature and 
Scientific American are published by Nature Publishing Group.) 
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Jie ae (right), president of Shanghai Jiao Tong University awards an honorary doctorate to Peter Salovey, president of Yale University. 


Chinese university 
reform in three steps 


High-quality faculty, valued and rewarded, is the key to 
building a world-class research institution, says Jie Zhang. 


( vate economic growth is slowing 
after 35 years of rapid expansion. 
Sustainable development depends 

on converting that growth into innovation. 

So the Chinese government has substan- 

tially increased its investment in universities 

and research institutes. In 2012, for exam- 
ple, it spent more than 1 trillion renminbi 

(US$161 billion) on research and develop- 

ment and more than 700 billion renminbi 

on higher education. 
As a result, research capacity and pro- 
ductivity have grown. Between 2005 and 

2012, the number of full-time-equivalent 


researchers in China increased by 38% (to 
314,000). Over the same period, the number 
of published research articles from Chinese 
higher-education institutions rose by 54% 
(to 1,117,742) and granted patents went up 
eightfold (to 66,755). 

Yet the quality of research, as indicated 
by citations, lags behind, and technology 
transfer is sluggish. Ossified practices in 
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evaluation and incentivization — such as 
rewarding publication quantity over qual- 
ity — are holding Chinese universities back. 

As president of a Chinese research uni- 
versity, I believe that building a high-quality 
faculty is the key to developing a world-class 
research university. In the past ten years, my 
institution, Shanghai Jiao Tong University 
(SJTU), has created a culture of innova- 
tion and boosted research capacity through 
reforms to career paths for existing and new 
faculty members. 

China’ leading universities are taking steps 
to enhance their research productivity. > 
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A robot developed by the School of Mechanical Engineering at Shanghai Jiao Tong University in China. 


> Setting up new institutes, hiring high- 
profile overseas scholars at high salaries and 
introducing tenure systems are proven short- 
cuts to excellence. But importing a handful of 
‘star’ professors is not sufficient to change uni- 
versity culture. Indeed, they can make local 
faculty members feel overlooked. Instead, 
fundamental reforms are needed. 

My experience at SJTU presents a good 
worked example. We have implemented a 
university-wide incentive system to motivate 
all faculty, staff and students. The goal is to 
develop a high-quality faculty comparable to 
that of the best Western universities by 2020. 

The changes began in 2007 with an 
analysis of future national and global chal- 
lenges, open discussions among faculty 
and staff, and a feasibility review. Priori- 
ties emerged, including engaging creative 
minds, incubating innovations and bridg- 
ing the sciences and humanities to best 
serve the nation and the world. To maintain 
faculty support, fundamental reforms were 
adopted in stages over several years. 


First: recruit and mentor junior staff. 
World-class scientists were hired to build 
research groups focused on cutting-edge 
science and engineering problems and to 
set a high bar for academic performance. 
Junior faculty members were recruited 
competitively in the international job mar- 
ket. Generous start-up funds supported 
the groups for the first few years, until they 
could attract funds from Chinese govern- 
ment agencies or industries, start to produce 
results and establish their reputations. 

A tenure system for new faculty members 
began in 2007. A six-year tenure track was 
set up for new junior faculty based on mid- 
term and final evaluation by an international 
committee. 


In 2008, we launched a mentoring system 
for junior faculty under 35 years old. And 
since then, 1,251 junior faculty members 
have won extra support for their research, 
housing and living costs from a 150-million- 
renminbi endowment. 


Second: three career tracks for faculty. 
Promotion and salary are assessed every three 
years against performance indicators devel- 
oped by each school or department. Three 
career tracks — teaching, research and tenure 
— were created in 2010 for existing faculty. 
The teaching track has no research expecta- 
tion; scientists with no teaching obligation 
must make up part of their income from com- 
petitive research grants. Tenure-track profes- 
sors teach and do research and are evaluated 
according to combined criteria. 

Each track has similar starting salary 
scales. Faculty members chose their own 
track, with the option to switch later if peer 
evaluations allow. All faculty may apply 
for transfer to the tenure-track system as 
standards rise; the best will be encouraged 
to do so. The average salary for faculty has 
increased by 60% in the past four years, and 
will rise further in the next four years. 

The university decentralized its governance 
to give schools and departments more auton- 
omy to recruit, develop and evaluate their 
staff. Budget reforms have given each school 
or department direct access to resources. 


Third: one merged tenure system. The infu- 
sion of fresh academic blood gave the univer- 
sity a split nature: international scholars on the 
Western tenure path; existing faculty follow- 
ing the three tracks. In 2013, six pilot schools 
or departments began to merge the two paths 
into a single tenure system similar to that of 
North American universities. Encouraging 
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results have already been seen in physics, 
mathematics, mechanical engineering, bio- 
medical engineering, law and management. 
On the basis of these experiences, the 
entire university will switch to the tenure 
system between 2015 and 2018. Although it 
is impossible to suddenly double or triple the 
salary scale for all 3,000 faculty members, 
mechanisms for paying competitive salaries 
within the tenure system are being built, 
such as setting up chairs and fellowships 
from a 500-million-renminbi endowment. 
Faculty members who do not qualify 
for the tenure track can choose to leave or 
remain in the contract system until it is 
phased out by 2018. It is important that fac- 
ulty who have served the university well but 
cannot meet the new standards be treated 
fairly and with respect and provided with a 
channel to continue to serve the institution. 


PROVEN RESULTS 

Today, SJTU is well on its way to being one 
of the leading research universities in the 
world. Since 2007, 450 world-class profes- 
sors and top-tier young faculty members of 
international standing have joined the uni- 
versity, and more than 250 existing faculty 
members have transferred to the tenure- 
track system. Eighty-five per cent of faculty 
hold PhDs (up from 50% in 2006). 

The university’s annual revenue has more 
than doubled since 2007, to more than 
7 billion renminbi; competitive research 
income has tripled to more than 2.5 billion 
renminbi. The number of disciplines offered 
at SJTU that are ranked in the global top 1% of 
Thomson Reuters’ Essential Science Indicator 
(ESD jumped from 5 in 2007 to 16 in 2014. 
Social sciences joined engineering, and natu- 
ral, life and medical sciences in the top 1%, 
making the university more comprehensive. 

In terms of number of papers published, 
the university has ranked second globally in 
engineering since 2007. ESI ranked natural 
sciences 40 in 2013, up from 57 in 2007. Life 
sciences and agriculture rose to 43 from 136, 
and medicine is now ranked 27, up from 166. 
Citations are up and more discoveries are 
being patented. 

The reforms at SJTU have promoted a 
shift in educational emphasis. We are mov- 
ing away from knowledge transfer to knowl- 
edge creation and from instruction-centred 
teaching to student-centred learning. Our 
philosophy has changed to nurturing stu- 
dents to be engaged, competent global 
citizens. A culture that values and rewards 
innovation has successfully taken root. = 


Jie Zhang is the president of Shanghai Jiao 
Tong University in Shanghai, China. He is 
an academician of the Chinese Academy 
of Sciences, and a foreign associate of 

US National Academy of Sciences. 

e-mail: jzhang1@sjtu.edu.cn 
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Companies 
on campus 


Housing industry labs in academic settings benefits all 
parties, say Jana J. Watson-Capps and Thomas R. Cech. 


ete Mariner works up the hall from 
Pp his PhD adviser and one floor down 
from his postdoc adviser, but he 
does not work in academia. He is a senior 
scientist at Mosaic Biosciences, a start-up 
developing synthetic materials to help 
wounds heal faster, yet his labs are in the 
University of Colorado Boulder. They are 
part of the university’s BioFrontiers Insti- 
tute, an interdisciplinary effort to tackle 
complex biology and forge connections 
with companies. 
Over the past three decades, academia 
and industry have been converging philo- 
sophically and physically’. Thirty-four years 


ago, the Bayh—Dole Act encouraged US 
academics to patent their discoveries, work 
with companies and become entrepreneurs’. 
Policies in Europe have moved in similar 
directions’. Companies increasingly partner 
with university scientists to enhance their 
research. In a 2007 survey of life-sciences 
faculty members from the 50 US universi- 
ties that receive the most financial support 
from US National Institutes of Health, just 
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over half of the respondents reported having 
some relationship with industry*, 

Successful academia—industry partner- 
ships require common interests, trust and 
good communication. For each of these, 
proximity helps. 

Many universities have off-campus 
research parks, but some academic research 
facilities have gone a step further and 
brought small companies within their own 
walls. BioFrontiers (of which J.J.W.-C. is 
associate director, and T.R.C. is direc- 
tor) is one of the youngest experiments 
in ‘co-location. More are set to open soon 
(see ‘Within the same walls’). When it is 
done well, all parties benefit. 


BUILDING BUDDIES 

Various university offices connect faculty 
members, students and companies through 
technology transfer, industrial partner- 
ships, student internships and mentoring. 
But these centralized resources do not allow 
for the spontaneous interactions that can 
arise from shared excitement about solv- 
ing a problem. Co-location removes the 
physical separation and the intermediaries 
between researchers in academia and those 
in industry, and so allows serendipitous 
relationships to bloom. 

Faculty members benefit from the influx 
of corporate expertise’. Researchers with 
industrial experience are often more 
knowledgeable about high-throughput 
technology and commercial applications 
than their academic counterparts. Our 
biomedical faculty members tell us that 
they value industry collaborations as a way 
to apply discoveries in ways that eventually 
benefit patients. Students gain real-world 
experience and opportunities to work at 
these companies as they expand. Young 
companies benefit from access to flexible 
lab space, core facilities, an invigorating 
research environment and an educated 
workforce. 

For example, when start-up Archer Dx, 
based in Boulder, began developing next- 
generation sequencing kits and software to 
research cancer treatments, it kept capital 
expenditures down by renting pre-built lab 
space at BioFrontiers and buying services 
from the university's genomics facility. When 
the company was purchased bya larger diag- 
nostics and reagents company (Enzymatics, 
headquartered in Beverly, Massachusetts) 
and moved to a larger space off campus, it 
hired several former students. 

Another example of co-location is 
the California Institute for Quantitative 
Biosciences (QB3). This supports two 
on-campus incubators for University of 
California spin-out companies, called ‘bio- 
tech garages’ in homage to the early Silicon 
Valley tech start-ups. One QB3 start-up 
is Caribou Biosciences, founded on > 
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> genome-engineering technology from 
Jennifer Doudna’s lab at the University of 
California, Berkeley. Following a now- 
familiar pattern, Caribou began operations 
in the Garage@Berkeley — steps from the 
Doudna lab — before moving into a larger 
space as the company grew. 

HudsonAlpha Institute for Biotechnol- 
ogy, a non-profit organization in Hunts- 
ville, Alabama, brings together principal 
investigators, postdocs and some students 
alongside core facilities and independ- 
ent companies that are developing new 
genomic technologies. ThermoFisher 
Scientific, a global biotech company based 
in Waltham, Massachusetts, bought one 
of the institute’s start-ups in 2008, and 
retains its operations in Huntsville, citing 
the importance of proximity to researchers 
outside their own expertise. 


RULES OF ENGAGEMENT 

Co-location has challenges. Universities 
are among the last places to prize research 
for the sake of pure discovery. All co- 
location leaders, business representatives, 
university administrators and develop- 
ment officers must help to implement the 
goals of the programme while protecting 
blue-sky research. 

Ideally, co-location should be financed 
with funds that would not normally go to 
basic research, such as rent from tenant 
companies, philanthropic donations aimed 
at entrepreneurship and targeted grants. 
We have furnished several core facilities 
serving both academics and local compa- 
nies using infrastructure grants from Colo- 
rado’s Office of Economic Development 
and International Trade. HudsonAlpha was 
founded and largely funded by scientist- 
entrepreneurs Jim Hudson and Lonnie 
MacMillan, specifically to house academic 
faculty members alongside small companies. 
A*STAR (Agency for Science, Technology 
and Research) in Singapore is funded mainly 
by government programmes to boost com- 
mercial research and development. 

Nonetheless, universities need to devote 
resources to addressing real and perceived 
conflicts of interest. This requires careful 
policies on intellectual property, use of uni- 
versity resources, faculty time and conflicts 
of interest. For example, students cannot 
be graded and employed part-time by the 
same person. On-campus companies should 
explicitly ensure participating students’ abil- 
ity to publish in a timely fashion, a practice 
already established for sponsored research 
agreements. 

Companies predisposed to open science 
might be attracted to co-location. Accommo- 
dating these companies on campus demands 
flexibility and clarity. Just as universities 
need to be up-front about their goals and 
expectations, they also need mechanisms to 


remove participants who might be better off 
in more conventional settings. For example, 
we have offered leases on lab space as short 
as six months, which can be renewed. In the 
future, lease renewals at BioFrontiers might 
also depend on how companies interact with 
academic neighbours, for example through 
mentoring students. 

Letting space to companies puts universi- 
ties in the sometimes-awkward position of 
a landlord who needs to evaluate whether 
potential tenants can fulfil their rental pay- 
ments and other obligations. Already, we 
have had a very young company leave a lab 
space after less than a month because antici- 
pated seed funding did not come through. 


COOKIE HOUR 

Customs and architecture should stimulate 

interactions. In the BioFrontiers build- 

ing, academic and company researchers 

share a café and common spaces. Labs and 
offices are arranged 


“A university so that people must 
must view pass through a main 
companiesas corridor to get from 
partners in one to another, 
its research encouraging hallway 
and education conversations. Each 


week, a company or 
academic lab hosts 
a ‘cookie hour’ for anyone in the build- 
ing. There are also whiteboards in hall- 
ways, where a spontaneous interaction can 
quickly turn into an idea sketch. 
Co-location will be most success- 
ful in academic settings that explicitly 
value entrepreneurship and translational 
research activities (for example, when 
recruiting faculty members or evaluat- 
ing them for promotion and tenure), and 
where resources are available to foster 
community and to support a leadership 
team to oversee the programme. Emerg- 
ing companies will be more likely to take 


mission.” 
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advantage of co-location opportunities if 
there are grants and seed funds available 
to subsidize their rent, if core facilities are 
available and if research collaborations 
with the university are easy to set up. 

Fundamentally, a university must view 
companies as partners in its research and 
education mission, not simply as an alterna- 
tive revenue source. 


UNIVERSITY ECOSYSTEM 

We believe that the daily interaction between 
education, research and enterprise resulting 
from co-location will connect universities 
to their communities and make them more 
relevant to students and parents paying 
tuition fees. Co-location sites will become 
magnets for entrepreneurial faculty mem- 
bers, postdocs and students, as well as for 
companies looking to hire new talent. 

The intersection of academia and indus- 
try will become more natural as faculty 
members look for more ways to make their 
discoveries relevant, as students want more 
value for their degrees, and as companies 
want more input into developing their work- 
force. Industrial inhabitants will be part of 
the future university ecosystem. = 


Jana J. Watson-Capps is associate director 
of the BioFrontiers Institute at the University 
of Colorado Boulder, USA. Thomas R. 
Cech is professor of chemistry and 
biochemistry at the University of Colorado 
Boulder and director of the BioFrontiers 
Institute. 

e-mail: jana.watson-capps@colorado.edu 
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in January 1995, during an Ebola outbreak in Zaire that killed 254 people. 


Ebola: learn from the past 


Drawing on his experiences in previous outbreaks, David L. Heymann calls for 
rapid diagnosis, patient isolation, community engagement and clinical trials. 


he Ebola outbreak identified in Guinea 

in March this year has spread to thou- 
sands of people across West Africa. 

Now, in crowded urban areas, transmission 
has accelerated. The number of confirmed 
and suspected Ebola cases reported in the 
week preceding 8 October (854) is more than 
twice the total number of cases confirmed in 
the largest previous outbreak, which lasted 
around four months, beginning in late 2000. 

Past outbreaks of Ebola were stopped 
while they were still in rural areas. Popula- 
tion density was lower, community ties were 
stronger and, arguably, measures to prevent 
transmission were easier to implement. 

Yet the lessons that I have learned in rural 
Africa since participating in the investiga- 
tion of the world’s first recorded outbreak 
of Ebola in 1976 still apply. People with 
fever and recent contact with an infected 
person must be diagnosed as soon as pos- 
sible, and those with Ebola must receive care 
in isolation wards. Communities need the 
knowledge and the means to prevent trans- 
mission, including safe ways to transport 
infected people to isolation wards and to 
handle dead bodies respectfully. 

The current outbreak demands all this 
and more. There is no proven treatment for 


Ebola. Amid distrust and battered health-care 
systems, affected countries and international 
workers are hoping to launch clinical trials. 
This outbreak will go on for months yet, and 
it will not be the last. It would be an injustice 
not to learn which of several experimental 
treatments can be used to save lives. 


HISTORY LESSONS 
I was a member of the team that investigated 
the first known outbreak of Ebola’. It was in 
the Democratic Republic of Congo (DRC, 
then called Zaire) at the Yambuku Mission 
Hospital nearly four decades ago. By tracing 
contacts and dates of infection, we searched 
for the first person infected (possibly from 
the blood of a game animal butchered for 
food) who became a source of infection for 
others. We concluded that this ‘index patient’ 
had been treated at the mission clinic for a 
nosebleed, and for dysentery with an injec- 
tion. His visit was noted in an unremarkable 
entry on line 2,355 of an outpatient ledger. 
The needles and syringes used at the hos- 
pital were shared with the maternity ward. 
Equipment was, at best, rinsed with distilled 
water between patients. Outpatients and 
pregnant women were infected by injections; 
health workers were infected by blood and 
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bodily fluids from patients, and workers in 
turn infected family and community mem- 
bers. The virus spread from that one index 
patient to 318 people, resulting in 280 deaths. 
The outbreak ended spontaneously, ironically 
hastened because the hospital closed after 
workers became infected or fled their posts. 

Ebola flared the next year at the Tandala 
Mission Hospital in the north of the DRC, 
about 250 kilometres from Yambuku. I was 
based in Cameroon as an epidemiologist for 
the US Centers for Disease Control and Pre- 
vention. A colleague and I drove for two days 
across the Cameroon and Central African 
Republic on unpaved roads through tropical 
rainforest to investigate. 

Although the index case in that instance 
had also received care at a poorly equipped 
hospital, a major outbreak did not occur’. 
The physician in charge, a participant in the 
investigation at Yambuku, had suspected 
Ebola and isolated the patient, a nine-year- 
old girl. Only one more infection occurred 
— in the girl’s younger sister — and blood 
tests of hospital workers and the patient's 
contacts found that the physician himself 
was carrying an antibody to Ebola, probably 
from a previous infection. 

These early investigations revealed > 
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> patterns. The first sign is often a cluster of 
people with diarrhoea or fever, lethargy and 
other symptoms sometimes confused with 
typhoid fever. Ebola emerges in rural settings; 
transmission occurs by contact with infected 
peoples’ blood and body fluids. Its spread is 
amplified by poor hospital practices (such 
as re-using needles) — health workers are at 
great risk of becoming infected and spread- 
ing the virus within hospitals and into their 
communities. Isolation can prevent hospital 
transmission, assuming that infection con- 
trols such as protective equipment and safe 
disposal practices are in place. 

By 1995, when virologist Jean-Jacques 
Muyembe and I led the response to the Ebola 
outbreak at Kikwit General Hospital in the 
DRC?, we knew that rapid and robust action 
could stop spread, even along major transport 
links. (Kikwit is 350 kilometres — 5 hours by 
road — from the capital Kinshasa, where one 
patient travelled to and was rapidly identified 
and isolated.) We also learned that communi- 
ties clearly understood the risk of infection, 
and could be persuaded to forego dangerous 
funeral rites, such as washing out deceased 
relatives’ mouths or clipping their fingernails. 


SUCCESSFUL STRATEGIES 

We had a three-pronged strategy. First, 
patients were identified and isolated, and 
protective clothing was provided to health 
workers. Second, contacts of all patients 
with Ebola were monitored, and their tem- 
perature taken twice a day for three weeks. 
Those with fever were isolated until diagno- 
sis could be confirmed and those with Ebola 
were hospitalized. Third, individuals were 
educated to protect themselves and their 
families. In this and several other outbreaks, 
organizations such as the Red Cross and the 
Red Crescent societies worked with village 
elders and chiefs to distribute information 
tailored to local traditions. Red Cross work- 
ers in protective gear provided transport for 
patients and burial services for the dead. 
When the hospitals were full in Kikwit, some 
patients were isolated in their homes. Their 
families were provided with protective cloth- 
ing and monitored daily. 

Other strategies have been less success- 
ful. Attempts to block the disease at A frica’s 
porous borders did not stop past outbreaks, 
and will not work now. A cordon sanitaire was 
established by the DRC government around 
Kikwit in 1995 and enforced through military 
roadblocks. But contacts under fever surveil- 
lance travelled outside the cordon in dugout 
canoes. The military patrolled roads, but not 
forest paths leading to the Kwilu River. 

Fast-forward to 2014. The quarantine of 
an urban slum of Monrovia, Liberia's capi- 
tal, was lifted just days after its declaration; 
it had led to armed clashes, even as residents 
reportedly moved in and out by avoiding 
check points or bribing their way through. 


Measures that were successful in stopping 
more than 20 major Ebola outbreaks would 
have probably worked in March in rural 
Guinea. These approaches are helping to 
contain another Ebola outbreak that began 
in March this year in the DRC. Now, they 
must be adapted for the more complex, dif- 
ficult situation in urban, mobile societies. For 
example, city wards may need to examine all 
fever cases daily, particularly where contract 
tracing is ineffective. 


TOUGH TRIALS 

The need for standard fluid replacement in 
patients is now urgent. Rich countries have 
developed a few experimental vaccines and 
therapies, spurred in part by fears of bioter- 
rorism. In August, the World Health Organi- 
zation (WHO) reached a global consensus 
that it would be ethical for clinical trials of 
these medicines to be carried out in affected 
countries, even if the 


standard regulatory “Survivor 
demonstrations of serum would 
safety and efficacy mnotbeinshort 
werenotyetcomplete. supply.” 


Indeed, rigorous clini- 
cal trials for such treatments are possible only 
during outbreaks, and are the only way to 
learn what is effective. 

In September, the Wellcome Trust, a bio- 
medical-research charity in London, pledged 
£3.2 million (US$5.2 million) to sponsor 
clinical trials. This is a fraction of the fund- 
ing needed, and other funders such as the Bill 
and Melinda Gates Foundation are joining 
in. Even with recent undertakings from the 
international community to send equipment 
and specialized staff, and to train thousands 
of health workers, conducting clinical trials 
will be difficult. The outbreak has closed or 
overwhelmed health-care facilities, where 
infected patients have been turned away from 
hospitals to die at home or in the streets. 

Nonetheless, the Wellcome Trust and 
others are assessing sites for clinical trials, 
and preparing potential protocols. They are 
likewise discussing ethical issues such as 
whether experimental treatments should be 
made available only in the context of rand- 
omized controlled trials, or through other 
study methodologies that would permit more 
widespread use. 

The number of infected patients far out- 
strips the limited supplies of candidate vac- 
cines, antivirals and other medicines. Again, 
experience from history may prove useful. 
For ten weeks after the Yambuku outbreak 
in 1976, I stayed on in a ‘plasmapheresis’ 
programme. Thirteen Ebola survivors vol- 
untarily supplied their blood plasma, which 
contained proteins that neutralize the Ebola 
virus. One of the plasma units was given to 
a laboratory technician accidentally infected 
in the United Kingdom; he survived*. Almost 
20 years later, during the Kikwit outbreak in 
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1995, eight patients received transfusions of 
whole blood from survivors; seven lived’. 

Whether these treatments work cannot 
be determined from this ad hoc adminis- 
tration of survivor-blood products. Some 
scientists doubt that they will work, suggest- 
ing that immune serum has been shown not 
to decrease the amount of Ebola virus in the 
blood. But after reviewing the experimental 
treatments available, an expert panel con- 
vened by the WHO recommended clinical 
trials using survivors’ blood, as did the WHO 
Blood Regulators Network. Compared with 
other experimental treatments, plasmapher- 
esis requires a complicated procedure and use 
of a potentially infectious fluid. But even some 
rural clinics in sub-Saharan Africa regularly 
collect blood (the first step in plasmapheresis) 
for transfusions, and screen out donors who 
carry HIV or hepatitis. 

Unlike the other experimental products, 
survivor serum would not be in short supply. 
As results become available, trials could 
transition into treatment if effectiveness is 
demonstrated (and health-care facilities re- 
established). This could encourage patients 
and their families to agree to hospitalization, 
and so help to isolate infected patients and 
stop the spread of infection. For instance, 
hundreds of survivors in Sierra Leone, which 
has so far seen around 2,700 cases of Ebola 
and about 880 deaths, could supply a plas- 
mapharesis trial enrolling large numbers of 
patients. In the past, survivors have often 
been willing to provide blood to help others 
— that will hopefully be the case again. 


AFTER THE HEADLINES 
Research and development for vaccines and 
treatments must continue once this terrible 
outbreak has passed. Eventually, as for all 
emerging infections, attention must shift to 
prevention at the source — keeping the Ebola 
virus from breaching the species barrier or 
using a future vaccine to prevent human 
infection. Once transmission is fully under- 
stood, communities can better prevent it. 
For now, full international support 
must be focused on stopping this outbreak 
through innovative and intensified patient 
isolation, contact tracing and community 
empowerment. 


David L. Heymann is professor of infectious 
disease epidemiology at the London School 
of Hygiene and Tropical Medicine, UK, and 
head of the Centre on Global Health Security 
at Chatham House, London, UK. 

e-mail: david. heymann@lshtm.ac.uk 
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Deep-sea hydrothermal vents may have provided the conditions for the origins of life. 


The first Spark 


David Deamer welcomes a synthesis of what we know 
about the origins of life, as told by a master in the field. 


History is a wonderful book. Harold 
has for 60 years been an intelligent 
and clear-minded researcher and observer 
in the fields of cell and molecular biology. 
His book is a loving distillation of connec- 
tions within the incredible diversity of life 
in the biosphere, framing one of biology’s 
most important remaining questions: how 
did life begin? 
This is also a personal account. Here is 
Harold musing after washing the dishes: “I 
look upon my work and see that it is good, 


Pitses Harold’s In Search of Cell 


and I have no doubt that the same need to find 
order in the universe motivates much of sci- 
ence. Using this deceptively casual approach, 
he cleans up the vast untidy mess of biology 
and stacks the fundamental concepts in an 
orderly and creative way for readers to enjoy. 

The content of each chapter can be found 
in any good undergraduate biology text, but 
Harold fits the information into a larger con- 
text, often in unexpected ways. For instance, 
he discusses geochemist Michael Russell’s 
idea that physical processes in hydrother- 
mal vents could produce proton gradients, 
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in which one side of a barrier membrane is 
acidic, the other alkaline. The movement 
of protons across these gradients supplies 
energy to all life now, and perhaps did so 
even in the first primitive life. Harold also 
reveals how much biologists can learn from 
geologists about the history of life on Earth. 
For instance, liquid water appeared on Earth 
more than 4 billion years ago; half a billion 
years later, the first known microbes (now 
fossilized in Australian rock) appeared. 

I do have a quibble. Harold argues that, 
notwithstanding the vast literature, progress 
has gone little beyond the findings of Soviet 
biochemist Alexander Oparin and British 
polymath J. B. S. Haldane more than 80 years 
ago, when they independently argued that 
Louis Pasteur’s dictum ‘All life from life’ was 
wrong. Oparin and Haldane theorized that 
life may have emerged on a sterile prebiotic 
Earth through a series of chemical and phys- 
ical processes. 

I confess to being more optimistic than 
Harold. There has been extraordinary pro- 
gress in understanding the principles by 
which life works at the molecular level, and 
that can be applied to the question of how 
life begins. Over the past eight decades, it has 
become clear that the basic molecules of life 
can be synthesized through well-understood 
chemical reactions. The Strecker synthesis, 
for instance, produced amino acids from 
methane, ammonia, hydrogen and water 
vapour in Stanley Miller’s famous 1950s 
experiment testing the Oparin—Haldane 
hypothesis. Furthermore, amino acids, 
nucleobases and lipid-like molecules — 
the building blocks of life — are present in 
carbon-containing meteorites. That makes 
it entirely plausible that similar organic 
compounds were available on the prebiotic 
Earth, waiting to be caught up in whatever 
process led to life’s beginning. 

There is more. In the 1960s, biophysicist 
Alec Bangham discovered that phospho- 
lipids assemble into cell-sized compartments 
(liposomes), and chemist Leslie Orgel found 
that chemically activated nucleotides — the 
organic molecular subunits of nucleic acids 
— spontaneously combine, or polymer- 
ize, into short strands 
of RNA. We now 
understand how light 
energy is captured by 
green plants, that the 
molecule adenosine 
triphosphate (ATP) 
is the energy cur- 
rency of all life and 
that enzymes such as 
polymerases use that 
energy to catalyse the 
polymerization of 
amino acids into pro- 
teins, and of nucleo- 
tides into nucleic 


In Search of Cell 


History: The 
Evolution of Life’s 
Building Blocks 
FRANKLIN M. HAROLD 
University of Chicago 
Press: 2014. 
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acids. The molecular foundation of evolu- 
tion became clear when DNAs structure and 
function were established by Francis Crick 
and James Watson in the 1950s and 1960s. 
Finally, we know how to encapsulate all those 
reactions in lipid compartments that mimic 
cell membranes, and several pioneering 
laboratories are taking the first steps towards 
fabricating microscopic systems of molecules 
that display the fundamental properties of life. 
Harold writes about these topics, so it 
seems that we have made considerable pro- 
gress after all. If we use a jigsaw puzzle as a 
metaphor, more than 80 years ago we opened 
the box and found hundreds of loose pieces; 
today, some of them have been correctly 
placed around the edges of the puzzle. We 
still cannot see the picture in the centre, but 
Iam satisfied that we have the framework. 
Thousands of young biologists work 
mostly on the narrowly defined problems 
that are the crux of successful grantsman- 
ship. Harold’s book is like a balloon that will 
let them rise above the trees for a while and 
look down to better understand the scope 
and shape of the forest — and perhaps 
then descend to pluck some low-hanging 


A depiction of 
an animal cell. 


fruit. Senior scientists like myself will take 
pleasure in comparing perspectives with 
Harold’s. This is, after all, a story to conjure 
with — that of how life began and evolved 
into eukaryotic cells, a hundred trillion of 
which compose the human body. No one can 
yet tell this story in its entirety, but Harold’s 
book is a good place to start. m 


David Deamer is chair of the department of 
biomolecular engineering at the University 
of California, Santa Cruz. His research 
concentrates on nanopore sequencing of 
nucleic acids. His most recent books are 

First Life and, co-edited with Jack Szostak, 
Origins of Life. 

e-mail: dwdeamer@ucsc.edu 


Books in brief 


Life’s Blueprint: The Science and Art of Embryo Creation 

Benny Shilo YALE UNIVERSITY PRESS (2014) 

The extraordinary ‘shape-shifting’ of the developing embryo marks 
embryology as one of the most visually arresting studies in science. 
Fittingly, evocative images pack geneticist and photographer Benny 
Shilo’s concise tour of the field’s evolution over the past 30 years. 
Shilo juxtaposes scientific photographs with his own stunning 
shots, chosen to elucidate the findings metaphorically. So a spiral 
staircase and its shadow against the sunlit side of a building echo 
the complementarity of DNA structure, while a relief carving in stone 
illustrates how cells are selectively killed to shape digits. 


How We Got to Now: Six Innovations that Made the Modern World 
Steven Johnson RIVERHEAD (2014) 

In this nimble history of invention, science writer Steven Johnson 
reframes ubiquity by focusing on six unglamorous innovations that 
triggered vast social transformation — from water purification to 
electric lighting. He uses a “long zoom” approach to history, tracing 
change on scales from the atomic to the planetary, to reveal how 
the impacts of innovation can be unexpected, for good or ill. From 
the sanitation engineering that literally raised nineteenth-century 
Chicago to the 23 men who partially invented the light bulb before 
Thomas Edison, this is a many-layered delight. 


Dr. Mutter’s Marvels: A True Tale of Intrigue and Innovation at the 
Dawn of Modern Medicine 

Cristin O’Keefe Aptowicz GOTHAM (2014) 

Fused skeletons, grossly enlarged colons and other pathological 
curiosities crowd the Mutter Museum in Philadelphia, Pennsylvania. 
But it is the collector — nineteenth-century surgeon Thomas Mutter 
— who stars in this beautifully detailed biography by writer Cristin 
O'Keefe Aptowicz. Mitter started out as a foppish medical student in 
Paris, but ended a hero tending to the injured poor of Chicago. What 
emerges here is a dual portrait of the driven doctor and a medical 
field transformed by scientific, if sometimes eccentric, pioneers. 


Claxton: Field Notes from a Small Planet 

Mark Cocker JONATHAN CAPE (2014) 

Naturalist Mark Cocker last astonished us with a global survey of avian 
and human interaction, Birds and People Jonathan Cape, 2013; see 
Nature 500, 25; 2013). Now he homes in on the local for this lovingly 
edited assemblage of 140 previously published pieces chronicling a 
‘year in the (wild) life’ of Claxton in East Anglia, UK. Cocker is a quietly 
eloquent guide to this landscape teeming with species from mouse 
moth to wych elm — describing, for instance, how wigeons “peel off 
the water as a continuous blanket that instantly atomises and falls 
back to earth amid a downpour of contact notes”. 


Molecules: The Elements and the Architecture of Everything 
Theodore Gray BLACK DOG AND LEVENTHAL (2014) 

This big, lush chemical romance of a coffee-table book showcases 
photographs of compounds and materials as if they were Bulgari 
jewels on black velvet. Theodore Gray, whose 2009 book and app 
The Elements (Black Dog and Leventhal) remains a huge best-seller, 
here canters through atomic and molecular structure and bonds; 
organic and inorganic chemicals; and materials. Gray’s wit and 
scientific nous blaze as he unpacks the mechanics of soap, the 
sulphur compounds in essence of skunk and more. Barbara Kiser 
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Dorothy Hodgkin (left) supervised future British Prime Minister Margaret Thatcher (Roberts at the time) in her undergraduate chemistry studies at Oxford. 


When Hodgkin met Thatcher 


Jessa Gamble on a radio play about the Nobel laureate and the UK prime minister. 


rapher Dorothy Crowfoot Hodgkin 
became the only British woman to win 
a Nobel prize in science. To mark the occa- 
sion, Adam Ganz’s radio play The Chemis- 
try Between Them imagines an overlooked 
chapter of Hodgkin's storied life: the period 
during the 1940s when she mentored a 
singular undergraduate in her lab at the 
University of Oxford’s Somerville College. 
That undergraduate was the only scientist, 
and the only woman, to become prime min- 
ister of Britain — Margaret Thatcher. 
Ganz’s play centres on a 1983 meeting 
between Hodgkin and Thatcher at the prime 
minister’s country home, Chequers, intercut 
with scenes in the lab. Ganz contrasts the two 
women's politics, and traces the evolution of 
Thatcher’s character under Hodgkin's tute- 
lage. He drew on Georgina Ferry’s Dorothy 
Hodgkin: A Life (Granta, 1998), Hodgkin's 
own writings and interviews with some who 
knewher, suchas the chemist Judith Howard. 
The Chemistry Between Them opens with 
a nervous Margaret Roberts, as she then was, 
newly arrived at Oxford from a life above her 
father’s shop. Hodgkin, 15 years her senior, 
begins to exert a subtle influence, introduc- 
ing cosmopolitan and feminist ideas into 
the younger woman's conservative Meth- 
odist mind. In their scientific discourse 
on structural stability and the antibiotic 
gramicidin B, Hodgkin nudges Roberts to 
abandon her black-and-white mindset in 
favour of more nuanced thinking. 


f ifty years ago this month, crystallog- 


Then we jump for- The Chemistry 
ward 40 years to the Between Them 
height of the cold war. ®DAM GANZ 
Hodgkin is president aus ne 

roadcast on 
of the Pugwash confer- 59 ,), gust 2014, 


ences on science and 

peace, trying to sway Thatcher, now prime 
minister, towards nuclear disarmament. 
Their politics were polarized. Hodgkin was 
awarded the Soviet Union's Lenin Peace Prize 
for furthering understanding between East 
and West, and was banned from entering the 
United States — except by a Central Intelli- 
gence Agency waiver — because her husband 
was a member of the UK Communist Party. 
Like many scientists of the 1930s and 1940s, 
she saw communism as the only system likely 
to fund science with an eye to the long view. 
Basic research seemed too easily dismissed as 
inefficient by free marketeers — an attitude 
that had its heyday during Thatcher's tenure, 
despite her scientific training. 

A puzzling flaw in the play is the assump- 
tion that there was any friendship between 
the two. Thatcher kept a picture of the 
crystallographer at 10 Downing Street in 
London, but the sentiment was not so fondly 
returned, according to family members. As 
an academic and a lobbyist, Hodgkin's visits 
to Chequers were explicitly political, and she 
was openly critical of Thatcher. 

Ganz has explored the interplay between 
science, politics and morality before. His 
2010 work Nuclear Reactions used tran- 
scripts from Farm Hall, a bugged English 
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country house where captured German 
nuclear physicists were interned, and their 
conversations recorded, after the Second 
World War (see A. Finkbeiner Nature 503, 
466-467 (2013) for another take on these 
events). 

Very different dynamics, politics and 
ethics pervade The Chemistry Between 
Them. Beyond its portrayal of two towering 
women, it depicts the radicalism of scientists 
of Hodgkin’s generation, and their decency 
and morality. Crystallographer J. D. Bernal, 
Hodgkin's PhD supervisor, serves as a stand- 
in for the unconventional thinkers with 
whom she mixed, from the philosopher and 
anti-nuclear activist Bertrand Russell to the 
political theorist Isaiah Berlin. Matter-of- 
fact discussions of contraception and mari- 
tal infidelity mark the group as radical for 
its time. 

Through her willingness to question con- 
sensus, Hodgkin became an exceptionally 
creative and powerful scientist. This play 
does not enhance what we know of her life, 
nor does the dialogue ring true for those who 
knewher. Where it does succeed is in show- 
ing that for her and many others, science was 
an integral part of moral and political life. = 


Jessa Gamble is a writer based in 
Yellowknife, Canada, and author of 

The Siesta and the Midnight Sun. Her 
grandmother was Dorothy Hodgkin’ first 
cousin. 

e-mail: jessa_sinclair@yahoo.com 
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Pakistan must invest 
in adaptation 


Floods in Pakistan this year 
alone have killed hundreds of 
people, left millions homeless 
and destroyed crops over tens 
of thousands of hectares. In its 
Global Climate Risk Index 2014, 
the think tank Germanwatch 
ranked Pakistan third in its list 
of countries most affected by 
climate change, after Haiti and 
the Philippines. 

Yet Pakistan's climate- 
change budget for 2013-14 was 
44% lower than the previous 
year’s. Furthermore, the federal 
government has largely devolved 
responsibility for environmental 
issues to the provinces, which 
cannot or will not commit 
resources to climate-change 
policies. 

It is important that 
the principles of disaster 
management are simplified so 
that the public can understand 
them and question government 
responses where necessary. 
Many citizens already realize 
that towns are being flooded 
as a result of illegal building on 
neighbouring floodplains and 
waterways. 

Diplomacy in India and 
Pakistan has secured reciprocal 
arrangements for flood relief, 
but this is not enough. Rainfall 
data need to be coordinated 
and exchanged between the 
two countries to improve 
flood forecasting and disaster- 
management governance through 
organizations such as the South 
Asian Association for Regional 
Cooperation. 

Pakistan and most other 
developing countries have little 
influence on actions determined 
by Western countries to reduce 
carbon emissions. The best 
option for developing nations is to 
offset the negative effects of rising 
temperatures and extreme events 
by developing weather-tolerant 
crops and housing, by planning 
for effective land use, and by 
improving energy efficiency. 
Abdur Rehman Cheema 
COMSATS Institute of 


Information Technology, 
Islamabad, Pakistan. 
arehmancheema@gmail.com 


Keep files small to 
curb energy use 


Electronic publishing 
circumvents environmental 
issues caused by paper use and 
the shipment of heavy journals. 
But more thrift is needed to 
reduce the energy consumed by 
Internet servers, which already 
accounts for 2% of global energy 
production (see, for instance, 
go.nature.com/dmqn9a). 

Large video and PDF files are 
downloaded and distributed by 
e-mail, often thousands of times. 
Minimizing the size of such files 
would reduce server energy 
usage and allow easier access by 
people in developing countries 
and rural areas that have slow 
Internet connections. 

Server energy is also wasted 
in distributing figures at 
unnecessarily high resolution. 
Governments and public 
organizations are particularly 
guilty (see, for example, an 
11-megabyte PDF from the 
European Research Council: 
go.nature.com/Iwcuyx). 

More scientific journals 
should ask authors to submit 
their manuscripts with low- 
resolution illustrations, which 
can then be upgraded for 
publication. Publishers of 
open-access journals might even 
consider offering a discount 
on publication charges for 
manuscript files that are smaller 
than, say, one megabyte. 

David Gurwitz Tel-Aviv 
University, Israel. 
gurwitz@post.tau.ac.il 


Tourism is least of 
cetaceans’ problems 


Ecotourism boats could indeed 
be harming dolphins and whales, 
for example by interrupting 

their foraging behaviour (see 
Nature 512, 358; 2014). But 
many whale-watchers do right 


by the animals and follow good 
practice. The major threats 
to cetaceans are still hunting, 
fishing, military sonar, undersea 
explosives and pollution. 

According to the International 
Whaling Commission, there 
were just 32 collisions with 
ecotourism boats, resulting in 
4 whale fatalities, between 1885 
and 2010 (see go.nature.com/ 
vehx93). Private boaters who do 
not follow proper guidelines are a 
problem. And ecotourism boats 
frighten Icelandic minke whales 
(Balaenoptera acutorostrata) 
probably because the animals 
confuse them with Iceland’s 
commercial whale-hunting 
vessels. 

Even where ecotourism 
boats have damaged cetacean 
populations — as in New 
Zealand’s Doubtful Sound, where 
tours caused bottlenose-dolphin 
numbers to fall by 11 between 
1997 and 2005 — these declines 
pale in comparison to the 
hundreds of pilot whales that are 
butchered each year in the Faroe 
Islands, or to the similar numbers 
of dolphins slaughtered in Taiji, 
Japan. Moreover, some 300,000 
cetaceans die every year as a result 
of entanglement in fishing gear 
(see go.nature.com/bh2wl7). 
Dale Frink Rancho Santa 
Margarita, California, USA. 
dalefrink@gmail.com 


Sanctions derail 
wildlife protection 


Blanket economic sanctions 

on politically unstable regimes 
that are rich in biodiversity 
deny local people access to 
international funding for wildlife 
conservation and management 
(see A. Waldron et al. Proc. Natl 
Acad. Sci. USA 110, 12144- 
12148; 2013). More-targeted 
restrictions could secure major 
biodiversity gains for relatively 
minor investment. 

The Red Sea coral-reef 
ecosystems of Sudan, for 
example, are among the 
world’s healthiest, with robust 
populations of top predators; 


in South Sudan, one of the 
largest migrations of terrestrial 
mammals occurs each year. The 
international community should 
recognize the importance of 
such unique ecological attributes 
and help to safeguard them 
through adequate funding and 
research. 

Terrorism, war and human- 
rights atrocities in Sudan’s Darfur 
province are rightly condemned, 
but the remaining 75% of the 
country is relatively peaceful. 
Indeed, wildlife protection 
brings socio-economic benefits 
that help to alleviate poverty 
and resolve conflict (see 
W. M. Adams et al. Science 306, 
1146-1149; 2004). Conservation 
should not be derailed by 
sanctions. 

Nigel Hussey University of 
Windsor, Ontario, Canada. 
nehussey@uwindsor.ca 


Great crested grebe 
usurps badger 


Michael Brooke's charming 
centennial reappraisal of Julian 
Huxley’s Courtship Habits of 
the Great Crested Grebe (Nature 
513, 484; 2014) missed an 
opportunity to mention the 
starring role these birds had in 
Evelyn Waugh's 1938 satirical 
novel Scoop. 

In this novel, Priscilla 
Boot — sister of nature writer 
William Boot — mischievously 
meddles with one of William's 
newspaper columns, swapping 
“badger” throughout for “great 
crested grebe”. The essay is duly 
printed — with the bird as its 
protagonist. 

A prodigious correspondence 
ensues: one letter asks whether 
the author condones the 
practice of baiting these rare 
and beautiful birds with terriers; 
another challenges him to 
produce a single authenticated 
case of a great crested grebe 
attacking young rabbits. 

Roger C. Prince Stonybrook 
Apiary, Annandale, New Jersey, 
USA. 

rogercprince@gmail.com 
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EVOLUTIONARY BIOLOGY 


Survival of the fittest group 


Experiments with social spiders find that colony size and composition affect colony survival in a site-specific manner, 
indicating that natural selection on group-level traits contributes to local adaptation. SEE LETTER P.359 


TIMOTHY LINKSVAYER 


rganisms often seem remarkably well 
() adapted to their environment, as a 

result of evolution by natural selection. 
Natural selection is usually thought to act at 
the individual level — when the survival or 
reproduction of individuals depends on their 
own traits — but it can also act at other levels, 
from genes to social groups to populations’. 
For example, the survival and reproduction 
of individuals or groups of individuals may 
also depend on group-level traits, resulting 
in group selection. On page 359 of this issue, 
Pruitt and Goodnight’ present a rare experi- 
mental study from a group-selection perspec- 
tive in natural populations, providing evidence 
that strong group selection on the colony traits 
of social spiders may drive adaptation to local 
conditions. 

The idea that natural selection at the group 
level is evolutionarily important and can 
produce group-level adaptations has been 
contentious and associated with semantic 
debate and confusion’. This debate has been 
especially heated regarding the relationship 
between group selection and kin selection — 
another form of natural selection. The concept 
of kin selection was originally formulated by 
evolutionary biologist W. D. Hamilton to 
explain the evolution of altruism’, which seems 
paradoxical from an individual's perspective, 
because, by definition, it involves fitness costs 
to the individual and benefits to the group. 
Yet from a gene’s perspective, altruism can be 
thought of as being selfish when beneficiaries 
are relatives (and thus share genes), because 
the gene experiences a net benefit by effectively 
helping itself to spread in the population. Sub- 
sequently, Hamilton and others showed that 
the evolution of social traits such as altruism 
can equivalently be understood as being driven 
by the balance between group selection and 
individual selection’. 

Kin selection and group selection are now 
broadly understood to describe the same 
evolutionary process from complementary 
perspectives’. However, although it is clear 
that iconic altruistic traits, such as the sterile 
worker castes of social insects, have been 
shaped by selection between kin groups, actu- 
ally studying how selection on group-level 


Figure 1 | Social spiders. Several species of the spider genus Anelosimus live in kin groups and cooperate 
in web construction, care of young and prey capture, as shown here in Anelosimus eximius. 


traits influences the evolutionary process in 
natural populations has proved difficult. 
Pruitt and Goodnight have done this by 
studying Anelosimus studiosus spiders, which 
live in social colonies containing related indi- 
viduals (Fig. 1). Female spiders consistently 
display either docile or aggressive behaviour, 
and colonies vary in the proportion of aggres- 
sive individuals in a site-specific way. To test 
whether these differences in colony compo- 
sition between sites may be shaped by group 
selection on colony composition, the authors 
constructed spider colonies containing dif- 
ferent proportions of aggressive and docile 
females and different group sizes at field sites 
that were either high resource or low resource. 
Each experimental colony was constructed 
from a single source colony. Some colonies 
were composed of ‘native’ individuals (the 
colonies were formed at the same site from 
which the spiders were taken) and others were 
composed of ‘foreign’ individuals (the spiders 
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were transplanted from a different site). 

The authors then tracked colony survival, 
composition and reproductive output over 
the next two generations, and found that the 
relationship between colony size and group 
composition strongly affects colony survival 
and reproductive success, and that sites with 
high or low resources consistently favour dif- 
ferent relationships. Furthermore, they found 
that, after two generations, surviving colonies 
had shifted their size and composition to be 
more similar to their home site. These results 
suggest that the relationship between group 
size and composition is both heritable and 
locally adapted. Because whole colonies of 
these spiders survive or die depending on 
group traits, group selection is probably 
playing a central part in driving this local 
adaptation. Although the causative agents of 
selection are not known, the authors iden- 
tify two likely culprits for colony extinction: 
egg cannibalism at low-resource sites and an 


ALEX WILD 


abundance of parasitic spiders of other species 
at high-resource sites. 

This spider system bears many similarities 
to colonies of social insects. Group selection, 
which in these species translates to colony- 
level selection and in some cases involves 
direct warfare between colonies, is thought 
to shape social insect traits®. Like social 
spider colonies, social insect colonies are com- 
posed of close kin, which probably explains 
why group selection has seemingly had such 
strong evolutionary impacts. Although actu- 
ally quantifying selection on colony-level 
traits of social insects has been elusive’, the 
social mechanisms that regulate some such 
traits, referred to as social physiology, are well 
studied in social insects and are understood to 
be colony-level adaptations’ that are shaped by 
colony-level selection’. 

This highlights a key remaining challenge 
in understanding the social spider system. 
The precise social and developmental mecha- 
nisms that enable spider colonies to adaptively 
adjust their colony composition must be char- 
acterized. Pruitt and Goodnight suggest that 
within-colony conflict or the collective ces- 
sation of reproduction are involved, which 
emphasizes another crucial issue: although 


the authors present a compelling case for a 
strong evolutionary role of group selection, it 
is important to determine whether individual 
selection also operates within colonies. That 
is, does the survival and reproduction of indi- 
viduals within groups depend on whether the 
individual is aggressive or docile? 

Another line of future research will be to 
study the genetic basis of group traits. Adapta- 
tion can occur only when selection acts on her- 
itable traits, so researchers need to understand 
the nature of the genetic architecture of specific 
traits together with patterns of selection on 
these traits. Some of the most persuasive evi- 
dence for the potential evolutionary strength of 
group selection comes from animal-breeding 
studies demonstrating that artificial selection 
at the group level can increase traits associated 
with productivity, even when these traits have 
long been the target of artificial individual- 
level selection’. The intuitive explanation 
for this is that an unintended side effect of 
individual-level selection for productivity is 
the evolution of highly aggressive individuals 
that monopolize resources to the detriment of 
group members. By contrast, group selection 
acts more efficiently on heritable traits under- 
lying positive social interactions, leading to 


Staying together on 
the road to metastasis 


Most deaths from breast cancer occur when the primary tumour spreads to 
secondary sites. It now emerges that clusters of tumour cells that enter the 
bloodstream form metastases more often than single circulating tumour cells. 


ALESSIA BOTTOS & NANCY E. HYNES 


key goal in cancer research is to under- 

stand the mechanisms underlying the 

metastatic process, by which cancer 
cells spread from a primary tumour to other 
sites and form secondary tumours. Patients 
with breast cancer, for example, usually do 
not die from their primary tumours, which 
are surgically removed, but from metastatic 
tumours in organs such as the bone, liver or 
lung. Writing in Cell, Aceto et al.' provide 
information about this complex pathway. 
They find that the cells that escape from 
primary tumours sometimes do so as clusters, 
and that these clusters have a higher ability 
to form lung metastases than single escaped 
cells, despite being present at much lower 
levels in the blood. The authors show that 
this difference stems from the expression of 
a cell-adhesion protein that allows the cells 
to stick together in clusters, thereby providing 


a survival advantage in the lungs. 

Blood-borne circulating tumour cells 
(CTCs) that have broken away from primary 
tumours were described more than 30 years 
ago, but technological limitations have until 
recently made it challenging to study them. 
With the advent of improved methods to 
detect, quantify and isolate CTCs’, we now 
know that these cells have tumour-forming 
ability** and that CTC numbers have prognos- 
tic significance in many types of cancer (see 
ref. 5 for a review). 

Considering that there are many more CTCs 
in the blood than there are metastatic tumours, 
it is of interest to discover the characteristics of 
a successful circulating cell. By mixing tumour 
cells labelled with different colours, Aceto and 
colleagues generated multicoloured primary 
breast tumours in mice, which allowed them 
to visualize multicoloured CTC clusters in the 
blood and lungs. Quantification revealed that 
even though cell clusters made up less than 
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the evolution of groups of amiable animals 
with high overall reproductive success ''. The 
ability of group selection to more effectively 
act on heritable social traits may also underlie 
the seemingly strong evolutionary influence 
of group selection in Pruitt and Goodnight’s 
study and earlier studies. = 


Timothy Linksvayer is in the Department 
of Biology, University of Pennsylvania, 
Philadelphia, Pennsylvania 19104, USA. 
e-mail: tlinks@sas.upenn.edu 
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3% of total CTC ‘events’ in the blood, more 
than 50% of the metastases were derived from 
clusters. The authors also found that CTC 
clusters represent aggregates of cells originating 
from the tumour and entering the vasculature, 
rather than tumour cells that clump together in 
the circulation to form clusters. 

To strengthen their conclusions, the authors 
compared the ability of single CTCs and CTC 
clusters to generate lung metastases directly. 
Following injection into the tail vein of mice, 
cells from both populations efficiently reached 
the lungs, but the single CTCs underwent high 
levels of apoptotic cell death, whereas the CTC 
clusters survived much better and thus formed 
metastases more often. 

The results from these experimental models 
are exciting, but do they hold up in patients 
with cancer? To look at the prognostic signifi- 
cance of CTC clusters, the researchers moni- 
tored the blood of patients with advanced 
metastatic cancer. CT'Cs were found in 68% of 
the patients; among these, clusters were con- 
tinuously detected in 5.6% of the cases. Despite 
the rarity of the clusters, their continued pres- 
ence was correlated with a significantly shorter 
period without disease progression in patients 
with breast cancer and with reduced overall 
survival in patients with prostate cancer. 

To identify the molecular mechanisms 
governing CTC-cluster formation, Aceto and 
colleagues used devices called "**CTC-iChips® 
to isolate single CTCs and CTC clusters from 
the blood of patients with breast cancer, and 
then sequenced RNA transcripts from the cells. 
Although there were no large gene-expression 
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abundance of parasitic spiders of other species 
at high-resource sites. 

This spider system bears many similarities 
to colonies of social insects. Group selection, 
which in these species translates to colony- 
level selection and in some cases involves 
direct warfare between colonies, is thought 
to shape social insect traits®. Like social 
spider colonies, social insect colonies are com- 
posed of close kin, which probably explains 
why group selection has seemingly had such 
strong evolutionary impacts. Although actu- 
ally quantifying selection on colony-level 
traits of social insects has been elusive’, the 
social mechanisms that regulate some such 
traits, referred to as social physiology, are well 
studied in social insects and are understood to 
be colony-level adaptations’ that are shaped by 
colony-level selection’. 

This highlights a key remaining challenge 
in understanding the social spider system. 
The precise social and developmental mecha- 
nisms that enable spider colonies to adaptively 
adjust their colony composition must be char- 
acterized. Pruitt and Goodnight suggest that 
within-colony conflict or the collective ces- 
sation of reproduction are involved, which 
emphasizes another crucial issue: although 


the authors present a compelling case for a 
strong evolutionary role of group selection, it 
is important to determine whether individual 
selection also operates within colonies. That 
is, does the survival and reproduction of indi- 
viduals within groups depend on whether the 
individual is aggressive or docile? 

Another line of future research will be to 
study the genetic basis of group traits. Adapta- 
tion can occur only when selection acts on her- 
itable traits, so researchers need to understand 
the nature of the genetic architecture of specific 
traits together with patterns of selection on 
these traits. Some of the most persuasive evi- 
dence for the potential evolutionary strength of 
group selection comes from animal-breeding 
studies demonstrating that artificial selection 
at the group level can increase traits associated 
with productivity, even when these traits have 
long been the target of artificial individual- 
level selection’. The intuitive explanation 
for this is that an unintended side effect of 
individual-level selection for productivity is 
the evolution of highly aggressive individuals 
that monopolize resources to the detriment of 
group members. By contrast, group selection 
acts more efficiently on heritable traits under- 
lying positive social interactions, leading to 
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secondary sites. It now emerges that clusters of tumour cells that enter the 
bloodstream form metastases more often than single circulating tumour cells. 
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key goal in cancer research is to under- 

stand the mechanisms underlying the 

metastatic process, by which cancer 
cells spread from a primary tumour to other 
sites and form secondary tumours. Patients 
with breast cancer, for example, usually do 
not die from their primary tumours, which 
are surgically removed, but from metastatic 
tumours in organs such as the bone, liver or 
lung. Writing in Cell, Aceto et al.' provide 
information about this complex pathway. 
They find that the cells that escape from 
primary tumours sometimes do so as clusters, 
and that these clusters have a higher ability 
to form lung metastases than single escaped 
cells, despite being present at much lower 
levels in the blood. The authors show that 
this difference stems from the expression of 
a cell-adhesion protein that allows the cells 
to stick together in clusters, thereby providing 


a survival advantage in the lungs. 

Blood-borne circulating tumour cells 
(CTCs) that have broken away from primary 
tumours were described more than 30 years 
ago, but technological limitations have until 
recently made it challenging to study them. 
With the advent of improved methods to 
detect, quantify and isolate CTCs’, we now 
know that these cells have tumour-forming 
ability** and that CTC numbers have prognos- 
tic significance in many types of cancer (see 
ref. 5 for a review). 

Considering that there are many more CTCs 
in the blood than there are metastatic tumours, 
it is of interest to discover the characteristics of 
a successful circulating cell. By mixing tumour 
cells labelled with different colours, Aceto and 
colleagues generated multicoloured primary 
breast tumours in mice, which allowed them 
to visualize multicoloured CTC clusters in the 
blood and lungs. Quantification revealed that 
even though cell clusters made up less than 
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the evolution of groups of amiable animals 
with high overall reproductive success ''. The 
ability of group selection to more effectively 
act on heritable social traits may also underlie 
the seemingly strong evolutionary influence 
of group selection in Pruitt and Goodnight’s 
study and earlier studies. = 
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3% of total CTC ‘events’ in the blood, more 
than 50% of the metastases were derived from 
clusters. The authors also found that CTC 
clusters represent aggregates of cells originating 
from the tumour and entering the vasculature, 
rather than tumour cells that clump together in 
the circulation to form clusters. 

To strengthen their conclusions, the authors 
compared the ability of single CTCs and CTC 
clusters to generate lung metastases directly. 
Following injection into the tail vein of mice, 
cells from both populations efficiently reached 
the lungs, but the single CTCs underwent high 
levels of apoptotic cell death, whereas the CTC 
clusters survived much better and thus formed 
metastases more often. 

The results from these experimental models 
are exciting, but do they hold up in patients 
with cancer? To look at the prognostic signifi- 
cance of CTC clusters, the researchers moni- 
tored the blood of patients with advanced 
metastatic cancer. CT'Cs were found in 68% of 
the patients; among these, clusters were con- 
tinuously detected in 5.6% of the cases. Despite 
the rarity of the clusters, their continued pres- 
ence was correlated with a significantly shorter 
period without disease progression in patients 
with breast cancer and with reduced overall 
survival in patients with prostate cancer. 

To identify the molecular mechanisms 
governing CTC-cluster formation, Aceto and 
colleagues used devices called "**CTC-iChips® 
to isolate single CTCs and CTC clusters from 
the blood of patients with breast cancer, and 
then sequenced RNA transcripts from the cells. 
Although there were no large gene-expression 
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Plakoglobin-expressing 


Metastasis 


Figure 1 | Circulating tumour cells and their clusters. Circulating tumour cells (CTCs) are cells that 
escape from a primary tumour and enter the bloodstream, which carries them to distant organs where they 
can form metastatic tumours. Although most CTCs are single cells, occasional CTC clusters are detected 
in the blood of patients; the presence of these clusters, which show high expression of the adhesion protein 
plakoglobin, is correlated with worse patient prognosis. Using in vivo mouse models of breast-cancer 
metastases, Aceto et al.' show that CTC clusters originate from groups of cells in the primary tumour that 
are held together by plakoglobin. The authors also demonstrate that CTC clusters have a greater potential 
to form lung metastases than single CTCs, owing to a survival advantage in the lungs. 


differences in the cells originating from the 
two populations, expression of some cluster- 
associated genes, including that encoding the 
protein plakoglobin, was increased in cells 
from clusters. 

Plakoglobin was discovered more than 
30 years ago as an adhesion protein associ- 
ated with the cell membrane. Although it has 
previously been implicated in cancer, its role 
is controversial (see ref. 7 for a review). For 
example, one study proposed plakoglobin as 
a tumour-suppressor protein, whose expres- 
sion is downregulated by epigenetic (non- 
DNA-sequence-changing) modulation in 
some tumours’, whereas another suggested 
that it has a tumour-inducing effect’. It is 
possible that plakoglobin has different roles in 
different cancer types, but from Aceto and col- 
leagues’ results it seems clear that the protein 
acts as a metastasis promoter in breast cancer. 
Indeed, in an in silico analysis of publicly 
available data sets from almost 2,000 patients, 
the authors correlated high plakoglobin levels 
with a significant reduction in metastasis-free 
survival time. 

Returning to mouse studies, the researchers 
found that reducing plakoglobin expression in 
breast-cancer cells resulted in the destruction 
of CTC clusters and reduced their metastatic 
ability. These data suggest that plakoglobin 
expression in tumour cells is responsible 
for the formation of CTC clusters and that 
plakoglobin-mediated clustering provides 
these cells with a survival advantage when they 
reach the lungs (Fig. 1). 

But how does plakoglobin work? Is its effect 
mediated by adhesion activity and/or by 
controlling signalling pathways in the cell? 
Plakoglobin is a homologue of the protein 
B-catenin, and competes with it for binding 


to E-cadherin (another adhesion protein) 
and transcription factors of the TCF family. 
These proteins are involved in the Wnt 
signalling pathway, which is abnormally active 
in several types of breast cancer. Perhaps 
plakoglobin influences signalling by antago- 
nizing B-catenin and thereby affects Wnt 
signalling in breast-cancer cells. This and other 
hypotheses will need to be investigated. 

It will also be of interest to assess whether 
plakoglobin is a drug-targetable molecule in 
metastases. Reduction of plakoglobin expres- 
sion did not affect the dissemination of single 
CTCs to the lungs in Aceto and colleagues’ 
mouse studies, and these cells, although 
they have less metastatic power than CTC 
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clusters, can still colonize distant organs. 
Thus, identifying molecular targets that are 
common to all CTC populations remains a 
goal for future studies. 

Although several questions about CTCs are 
still open, the efforts currently being expended 
on characterizing these cells are motivated by 
their clear clinical relevance’. Thanks to the 
feasibility of blood sampling and the fact that 
CTCs can be detected even in the early stages 
of primary cancer, these cells could be excel- 
lent biomarkers for early diagnosis. CTCs and 
CTC clusters could also be quantified to moni- 
tor treatment responses and the progression of 
metastatic disease. Moreover, considering the 
genetic differences reported between primary 
tumours and metastases”, choosing effective 
therapies to treat metastases might be better 
informed by analysing CTCs, the cells capable 
of disseminating to distant organs, rather than 
by analysing primary tumour cells. m 
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How tiny galaxies 


form stars 


Observations of two faint galaxies with a low abundance of elements heavier 
than helium show that the galaxies have an efficiency of star formation less than 
one-tenth of that of the Milky Way and similar galaxies. SEE LETTER P.335 


BRUCE ELMEGREEN 


tar formation is well studied in bright 
galaxies such as the Milky Way, where it 
occurs by localized gravitational collapse 
in dense cold clouds. The clouds are mostly 
composed of gaseous atomic and molecular 
hydrogen and helium, with a small fraction 
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of the clouds’ mass being in the form of dust 
particles made from condensed ‘metals’ (ele- 
ments heavier than helium). Trace amounts of 
carbon, oxygen and other elements also make 
heavy molecules, such as carbon monoxide. 
Dust is important for star formation because 
it prevents most starlight from getting inside 
the clouds, allowing heavy molecules to radiate 


S.D. VAN DYK/IPAC/CALTECH/NOAO 


away their heat and cool to 
the point at which gravity 
overcomes gas pressure and 
stars are made. Astrono- 
mers know much less about 
star formation in small faint 
galaxies, which is the subject 
of the paper by Shi and col- 
leagues’ on page 335 of this 
issue. 

Small galaxies have a 
lower abundance of heavy 
elements (metallicity) than 
large galaxies, and their 
average gas densities are 
also low, making gravity 
inside the clouds relatively 
weak and dust absorption 
of starlight relatively small. 
As a result, star formation 
seems to be slow, as Shi et al. 
report. The basic difference 
between small and large gal- 
axies is their rotation speed, 
which is related to mass. The 
Milky Way rotates rapidly 
around its centre, with a 
speed of more than 200 kilo- 
metres per second’. Rapid 
rotation means that gravity 
binds our galaxy tightly, that the speed at 
which material can escape the galaxy is high, 
and that metal-rich debris from stellar winds 
and supernova explosions gets trapped in the 
interstellar medium. This debris is the material 
that has been processed by nuclear reactions 
inside stars and is the origin of all heavy ele- 
ments. After a cosmic age of stars forming and 
dying, the trapped metals add up to a few per 
cent of the total mass in gas and stars for galax- 
ies the size of the Milky Way. This is enough for 
the resulting dust to block starlight from cloud 
interiors and allow molecules to form, cool and 
collapse into stars’. 

By contrast, the two galaxies studied 
by Shi and co-workers, Sextans A (Fig. 1) and 
ESO 146-G14, have low rotation speeds (23 and 
70kms", respectively) and masses that are 
only 0.2% and 13% of the Milky Way’s mass*”. 
These galaxies are too tiny to have trapped 
most of their heavy elements from a lifetime 
of supernovae, and indeed the abundance of 
heavy elements in these galaxies relative to 
hydrogen is less than 10% of that in the Milky 
Way*”. Weak gravity also means that they have 
low gas pressures, on average. Asa result, we do 
not expect molecules to form in dense clouds, 
and so the presence of young stars in these 
galaxies is a puzzle. 

Shi and colleagues’ study bypasses the 
molecules and looks for the associated dust 
instead. The problem with low-metallicity 
galaxies is that their molecular gas is very dif- 
ficult to observe. Molecular hydrogen at the 
low temperatures required for star formation 
does not emit radiation efficiently, and carbon 


Figure 1 | A dwarf irregular galaxy. Sextans A is a small faint galaxy spanning about 
1,500 parsecs. In this view, the stars of Sextans A (blue) lie beyond the bright Milky Way 
foreground stars (yellow). 


monoxide, the next most abundant molecule, 
is rare when both carbon and oxygen are rare’. 
The dust mixed with the gas can be detected, 
however, because it radiates in the infrared 
regime of the electromagnetic spectrum, 
at wavelengths between 10 and 1,000 um. 
Detection requires a large telescope on a satel- 
lite because Earth’s atmosphere absorbs most 
infrared light and makes cosmic sources nearly 
invisible from the ground. Shi et al. combined 
observations of dust from the Herschel and 
Spitzer infrared space telescopes with obser- 
vations of regions that contain hot young stars 
from an ultraviolet space observatory, the 
GALEX Space Telescope, to determine the dust 
masses and star-formation rates in Sextans A 
and ESO 146-G14. The authors also derived 
the mass of atomic hydrogen from archival 
ground-based radio observations”. 

The main result of Shi and colleagues’ study 
is that there is much more infrared light than 
would be expected for the atomic hydrogen and 
star-formation rates that are present in these 
two galaxies. More infrared means that there 
is more dust than anticipated, and much more 
gas considering the low abundance of heavy 
elements there. Moreover, this gas has to be 
molecular because not enough atomic hydro- 
gen is observed. Shi et al. conclude that a large 
mass of unseen molecules has to be present 
near the observed regions of star formation. 
However, then there is a problem with the rates 
at which star formation occurs, which should 
be ten times larger than they are if the effi- 
ciency of star formation, the rate per molecule, 
is the same as in the Milky Way and similar 
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galaxies. The reasons for these 
peculiarities are unknown. 
Previous models’ for low 
star-formation rates in such 
galaxies were based on mol- 
ecules being prevented from 
forming in the first place, 
but that is apparently not 
happening in Sextans A and 
ESO 146-G14. 

Astronomers should 
understand much more 
about molecular clouds and 
star formation at low lev- 
els of metallicity in the next 
few years. A new interfero- 
metric telescope, the Atacama 
Large Millimeter/submilli- 
meter Array (ALMA) in 
Chile, was designed to detect 
faint emission from mol- 
ecules and dust’*. Now in its 
third year of observations, 
ALMA is powerful enough 
to map even rare molecules 
such as carbon monoxide at 
the sparse elemental abun- 
dances of faint galaxies. This 
would allow the temperatures, 
densities and motions of the 
star-forming gas to be determined from spectral 
signatures of the molecules. 

Sextans A and ESO 146-G14 are examples 
of what galaxies might have looked like in 
the first billion years after the Big Bang. Even 
future Milky Way-like galaxies were small 
then, and had relatively few heavy elements™. 
Understanding star formation in the smallest 
galaxies of our own backyard may give us con- 
siderable insight into the earliest star forma- 
tion in the Universe. = 
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The origin of human 
retinoblastoma 


The cellular origins of most human cancers remain unknown, but an analysis of 
embryonic retinal cells identifies differentiating cones as the cell of origin for the 
childhood cancer retinoblastoma. SEE LETTER P.385 


ROD BREMNER & JULIEN SAGE 


n enduring mystery in our effort to 

understand most human cancers is the 

identities of the cells from which they 
arise. Attempts to define these ‘cells of origin’ 
have often used markers that are expressed in 
advanced tumours as a reference point. How- 
ever, because cancer cells have, by definition, 
undergone a transformation from a normal to 
a diseased state, this approach is fatally flawed. 
By analogy, passengers disembarking from 
an aeroplane wearing winter clothes might 
look as if they had boarded in a cold country, 
but they could equally be arriving in a wintry 
location having set off from somewhere 
warm. In this issue, Xu et al.’ (page 385) take 
an alternative approach to the cell-of-origin 
problem, identifying the cell type that gives rise 
to retinoblastoma by studying normal cells in 
the human retina. 

Retinoblastoma is a childhood cancer of 
the retina that often serves as a model sys- 
tem for cancer studies. Indeed, work on this 
cancer led to the seminal discovery of the 
RB1 gene’, which encodes the retinoblastoma 
tumour-suppressor protein RB. To investigate 
the cell of origin of retinoblastoma, Xu and 
colleagues manipulated human embryonic 
retinal cells, and found that precursor cells 
destined to become cone photoreceptors are 
unusually sensitive to the loss of RB1. The fact 
that cone precursors are differentiating cells 
committed to forming light-sensing retinal 
cells indicates that the cells of origin of human 
cancers do not necessarily have to be stem- or 
progenitor-cell types, as is often posited**. The 
authors purified human cones and showed that 
RB depletion in these cells, but not in other 
retinal populations, causes retinoblastoma 
when the cells are transplanted into recipi- 
ent mice — a finding that resolves decades of 
debate? (Fig. 1). 

These data are compelling, but live imaging 
of early tumours from patients’ eyes shows 
that lesions occur in the ‘inner nuclear layer 
of the retina’. This is the middle of three strata 
that comprise the retina, but cones are located 
in the outermost layer. Retinoblastomas 
might grow from displaced cone precursors 
in the inner nuclear layer. Alternatively, it 
is conceivable that a lack of RB causes a cell 


in the inner nuclear layer to change its fate 
to become a cone or cone-like cell, because 
differentiating retinal cells are plastic. The 
susceptibility of purified cones to division 
and transformation following the loss of RB 
suggests that this idea is unlikely, although 
one should bear in mind that the cells used in 
these experiments have been dislodged from 
their normal milieu. There is also precedence 
for fate change in other cell lineages after 
RB loss’. 

What do the current results mean for mouse 
models of cancer? Mice are better protected 
from retinoblastoma than are humans — other 
tumour-suppressor genes must be deleted in 
genetically engineered mice in addition to 
the Rb1 gene to cause the cancer to develop*”. 
As in humans, RB loss causes abnormal divi- 
sion of differentiating mouse retinal cells, but 
whereas Xu and co-workers observed that only 
cones are significantly affected in the human 
retina, all neuronal cell types are perturbed 
in that of the mouse*”. The cell of origin for 
mouse retinoblastoma is also a differentia- 
ting retinal neuron, although of the amacrine 
(interneuron) lineage rather than the cone 
lineage’. 

Amacrine and cone cells are generated in 
the retina at around the same developmen- 
tal stage, and may thus share aspects of their 


Cone-precursor cell 


rvyvyy" 


Retinal cells 


gene-expression circuitry, especially early in 
their development. Indeed, the gene-expres- 
sion patterns in human and mouse retino- 
blastoma are similar'’, and there are also 
parallels in the genetic mutations that they 
harbour, such as deletions in the CDKN2A 
tumour-suppressor gene’. Furthermore, 
amacrine cells are located in the inner nuclear 
layer of the retina, where retinoblastoma 
emerges in humans. Thus, although there are 
differences in retinoblastomas between the two 
species, the numerous similarities make mouse 
models a valuable tool for future research and 
therapeutic testing. 

One central issue in retinoblastoma and 
many other familial cancers is the striking 
specificity of tumour development. Why 
do patients with mutations in RB1 develop 
tumours specifically in the eye before the 
age of five, even though the gene is expressed 
everywhere? The answer may lie in this latest 
study, and in previous observations made by 
the same group”. It seems that the molecular 
circuitry that is present in cone precursor cells 
renders them uniquely sensitive to cancerous 
transformation when RB is lost. 

For instance, Xu et al. found evidence to 
suggest that high levels of the ubiquitin ligase 
enzymes SKP2 and MDM2, and of the cancer- 
causing protein N-Myc, are crucial for cone 
precursors to begin proliferating without 
undergoing programmed cell death. Mouse 
amacrine cells seem to have similar circuits 
that sensitize them to the loss of RB, including 
the ability to resist cell death driven by the 
transcription factor E2F — a normal result of 
E2F expression following loss of RB function 
in mouse retinal cell types’. One interesting 
exception is the p107 protein, a relative of RB 
that has a tumour-suppressor role in mice’ 
but which the current study indicates can 
promote the development of cancer in 
human cone precursors harbouring RBI 
mutations. 


Retinoblastoma 


No effect 


Figure 1 | The cone stands alone. Human retinal progenitor cells give rise to seven distinct cell types. 
Retinoblastoma develops specifically from differentiating cone precursors, owing to the molecular 
circuitry in these cells, which includes high expression of N-Myc, SKP2 and MDM2 proteins. This 
expression pattern permits the cells to proliferate and undergo a cancerous transformation when the 
tumour-suppressor protein RB is lost. In other retinal cell types, loss of RB either has no detectable effect 


or induces cell death (not shown). 
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In conclusion, Xu and colleagues’ fantastic 
work solves a controversial issue and provides 
a proof of principle for similar studies in other 
solid tumours. Once again, retinoblastoma 
acts as a model for the cancer field. Knowl- 
edge of the cell of origin for retinoblastoma 
(and other cancers) may help researchers to 
develop approaches for better diagnosis, earlier 
detection, and possibly chemoprevention. In 
addition, a better understanding of the molec- 
ular circuitry that renders cells susceptible to 
cancerous transformation may help to uncover 
Achilles heels in tumour cells. = 
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A historic experiment 


redesigned 


Large quasiparticles known as Rydberg excitons have been detected ina 
natural crystal of copper oxide. The result may find use in applications such as 
single-photon logic devices. SEE LETTER P.343 


SVEN HOFLING & ALEXEY KAVOKIN 


n exciton is a quasiparticle in a solid- 
A= system comprising an electron 

and a hole (the absence of an elec- 
tron). It has an energy spectrum akin to that 
of a hydrogen atom, and so may be con- 
sidered as an artificial hydrogen atom in a 
solid-state environment, with the hole play- 
ing the part of the hydrogen’s proton. The 
concept of excitons was first formulated in 
the early 1930s by Yakov Frenkel’, who pre- 
dicted their existence in molecular crystals. A 
few years later, Gregory Wannier’ and Nevill 
Mott’ described these electron-hole bound 
states for inorganic semiconductors. In 1952, 
Evgeniy Gross and Nury Karryjew’ discovered 
these Wannier—Mott excitons experimentally 
in a copper oxide (Cu,O) semiconductor. 
Now, on page 343 of this issue, Kazimierczuk 
et al.” report how they have redesigned this 
historic experiment to find excitons in a natu- 
ral crystal of copper oxide. The excitons extend 
across some tens of billions of lattice sites of 
the crystal. 

Gross and Karryjew’s discovery marked the 
beginning of ‘excitonics’ — an area of solid- 
state physics that holds promise for applica- 
tions in optoelectronics and in information 
and communication technologies’. For their 


studies, Gross and Karryjew selected crystals 
of copper oxide, and, using a spectrograph, 
identified eight dark lines in the material's 
transmission spectrum. Such absorption dips 
indicated the energies of optically induced 
transitions from the crystal’s ground state to 
excited states with principal quantum numbers 
n=2, 3,... 9 (Fig. 1). The transition energies 
scaled with n in a similar way to those of 
a hydrogen atom. This result proved that 
hydrogen-like quasiparticles, excitons, can be 
generated in these semiconductor crystals by 
photoabsorption. 

In their study, Kazimierczuk et al. performed 
high-resolution transmission spectro- 
scopy of an extremely high-quality natural 
crystal of copper oxide found at the Tsumeb 
mine in Namibia using laser light of tunable 
frequency and ultralow spectral linewidth 
(corresponding to roughly 1.2 megahertz). 
Taking advantage of the narrow linewidth of 
the laser and the high purity of the crystal, the 
authors have measured transmission spectra 
of the material with a spectral resolution of 
5 nanoelectronvolts — an extremely high value 
for optical spectroscopy experiments. Analysis 
of the spectra revealed absorption lines asso- 
ciated with excitons with principal quantum 
numbers as large as m= 25. The size of an exciton 
increases as n’, with n = 25 corresponding toa 
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50 Years Ago 


In general, the ‘epidemic process 
can be characterized as one 

of transition from one state 
(susceptible) to another (infective) 
where the transition is caused by 
exposure to some phenomenon 
(infectious material) ... People 

are susceptible to certain ideas 

and resistant to others. Once an 
individual is infected with an idea 
he may in turn, after some period 
of time, transmit it to others. Such a 
process can result in an intellectual 
‘epidemic ... The development of 
the psychoanalytic movement in the 
early part of the twentieth century 
was in its way no less an ‘epidemic 
than was the outbreak of influenza 
in 1917 and 1918. One can argue 
similarly that Darwin and evolution, 
Cantor and set theory, Newton 

and mechanics, and so on, were 
examples of ‘epidemics’ in the world 
of scientific thought which were 
instigated by the introduction of a 
single infective into a population. 
From Nature 17 October 1964 


100 Years Ago 


At the present time astronomers 
have no available organisation 

by which the news of important 
astronomical discoveries can be 
quickly distributed to the leading 
observatories of the world, nor is 
there a bureau with which anyone 
making an important discovery can 
immediately communicate with 

the knowledge that the news will at 
once be circulated world wide. This 
condition of affairs is due to the fact 
that the recognised Central Bureau is 
at Kiel, in Germany, and that the state 
of war prevents the circulation of any 
such news ... There is little doubt that 
ifthe Royal Astronomical Society of 
Great Britain would undertake ... the 
task of receiving and disseminating 
astronomical information, this act 
would meet with the approval of 
astronomers all the world over. 
From Nature 15 October 1914 
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Figure 1 | The spectral signatures of excitons. The diagram shows the spectral absorption dips (black) 
associated with the hydrogen-like transitions of Wannier—-Mott excitons in a copper oxide crystal. In their 
historic 1952 experiment’, Gross and Karryjew identified the eight dips that correspond to transition 


energies E, 


m 


with n=2, 3, ... 9. The transition with energy E, is forbidden by optical-selection rules and 


cannot be observed. Kazimierczuk et al.” have observed excitons in a natural crystal of copper oxide with 


nas large as 25. The energies are given in electronvolts. 


huge quasiparticle filling a sphere of diameter 
more than 2 uum — about ten times the 
wavelength of the light needed to create this 
exciton (see Fig. 1 of the paper®). Moreover, 
the authors found that excitons that formed at 
different locations in the crystal had, within 
the experimental accuracy of the measure- 
ments, identical spectral lines. This observa- 
tion confirmed the extraordinary quality of 
the sample. 

Such enormous quasiparticles, called 
Rydberg excitons, exhibit unusual quantum 
phenomena. One of them is giant diamag- 
netism, which is related to the excitons’ ability 
to counteract an applied magnetic field. This 
results in a blueward shift of the excitons’ 
spectral lines. Giant diamagnetism in copper 
oxide was first observed by Gross and col- 
leagues’, and the present experiments con- 
firm this. Another phenomenon observed 
by Kazimierczuk and co-workers is Rydberg 
blockade, which manifests as a reduction in 
excitonic absorption with increasing laser 
power for lines that correspond to large n. 
Rydberg blockade means that only a limited 
number of large Rydberg excitons is per- 
mitted within a given volume of the crystal. 
The effect could be used to make nonlocal 
all-optical switches and single-photon 
logic devices. 

Kazimierczuk and colleagues’ discovery of 
giant Rydberg excitons opens up new avenues 
for the field of excitonics. Furthermore, the 
standard descriptions of light-matter inter- 
action, in a regime in which the size of the 
exciton resulting from the interaction exceeds 
the wavelength of light used to create it, need 
to be revised. The long-standing assumption 
that extra boundary conditions for the exciton 
wavefunction, such as Pekar boundary condi- 
tions®, must be included in the classical calcu- 
lation of optical excitations in crystals acquires 
new importance in view of the nonlocal optical 
properties introduced in the crystals by giant 
Rydberg excitons: the light-induced creation 
of such an exciton at point A ina crystal may 
strongly modify the crystal’s optical prop- 
erties at point B if B is separated from A by 
more than 1 um. Also, the fine structure of 


the absorption lines associated with excitons 
of large n may be complex and unusual owing 
to multiple emission and absorption of vir- 
tual photons by the excitons’ and interactions 
between the spins and orbital momenta of the 
excitons electrons. 

Finally, with n=25 having now been 
achieved, the question arises of whether 
researchers might be able to observe excitons 
with principal quantum numbers as large 
as n=50, for which the exciton would have 
a diameter of about 1 mm — and so would, 
in principle, be visible to the naked eye. 


Answering this question would require 
Kazimierczuk and colleagues’ experiment 
to be performed at millikelvin temperatures 
(the present experiments were done at 1.2K), 
with a spectral resolution of about 1 neV. This 
might seem challenging, but is perhaps not 
impossible. m 
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BWHE 


Of monarchs 
and migration 


The genomes of 101 monarch butterflies from migratory and resident populations 
have been sequenced, revealing genes and molecular pathways that underlie 
insect migration and coloration. SEE ARTICLE P.317 


RICHARD H. FFRENCH-CONSTANT 


Stories, which explained, in colourful 
terms, how the leopard got his spots and 
the camel got his hump. Half a century later, 
Francis Crick and James Watson discovered 
the molecular structure of DNA', but despite 
the molecular revolution that followed, we 
still struggle to explain many examples of 
natural selection at more than a ‘just so level. 
Now, owing to the application of modern 
DNA sequencing to systems other than mice 
and humans, butterflies are leading a renais- 
sance in our understanding of the molecu- 
lar basis of natural selection”. In this issue, 
Zhan et al.’ (page 317) sequence a remarkable 
101 butterfly genomes, and tell a story of two 
parts — migration and coloration. 
During the summer, the monarch butterfly 


I 1902, Rudyard Kipling wrote the Just So 
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(Danaus plexippus) searches for milkweed 
plants on which to lay its eggs. Monarch cater- 
pillars acquire cardiac glycoside compounds, 
which are toxic to predators, from the plant. 
These compounds are stored in both the 
caterpillars and the butterflies, which display 
the warning colours orange, black and white 
(Fig. 1). In the autumn, North American 
monarchs migrate and congregate in trees in 
the Mexican mountains. Tropical monarchs 
are not strictly migratory, although popula- 
tions do make short-range migrations in the 
dry season. 

Which evolved first, the temperate migra- 
tory populations or the resident tropical 
groups? For birds, the ‘southern home’ theory 
suggests that migratory populations arose 
from non-migratory tropical populations’. 
Surprisingly, Zhan and colleagues’ analysis 
of migratory and non-migratory monarchs 


STEPHEN DALTON/NATUREPL.COM 


shows that these butterflies originated in 
North America, from a migratory ancestor. 
Tropical groups have less genetic diversity 
than their North American relatives, because 
they have gone through step-wise genetic 
bottlenecks during their colonization of the 
tropics, each of which reduced the diversity 
of their genomes. 

Even more unexpectedly, the authors’ 
analysis of monarch DNA suggests that migra- 
tory ability is linked to a single gene, encoding 
the protein collagen IV subunit a-1. Col- 
lagen IV is essential for the formation and 
efficient function of muscles’. When Zhan 
et al. undertook a detailed analysis of evolu- 
tionary selection patterns within this gene, they 
found evidence to suggest that alteration of a 
single amino acid affects the ability of the 
collagen subunits to co-assemble or tri- 
merize, perhaps conferring on migratory 
monarchs some undisclosed advantage for 
long-distance flight. 

What about the butterflies’ warning 
coloration? Historically, vertebrate and inver- 
tebrate pigmentation have been viewed as dis- 
tinct. For example, although the tiny coloured 
scales on the wings of butterflies are known 
to be related to flies’ bristles’, there has been 
little evidence to suggest that the processes 
of pigmenting a butterfly-wing scale and a 
mouse hair involve the same genetic play- 
ers. Zhan and colleagues have shed the first 
light on this subject by examining the ghostly 
white ‘nivosus’ monarchs — a variant found 
on the island of Oahu in Hawaii. 

Kipling might have said that the white 
monarchs got their colouring from roosting 
on the peaks of the ancient volcanoes Waianae 
and Koolau when they were capped with snow. 
But by sequencing nivosus and orange butter- 
flies from Oahu, the authors show that a single 
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Figure 1 | In-flight warning. The monarch butterfly displays orange, white and black warning colours. 


gene is strongly associated with the change in 
colour. That gene encodes a myosin protein 
related to the mammalian myosin 5a — a two- 
headed motor protein that can ‘walk along 
filaments of another protein, actin, within the 
cell®. Myosin 5a acts as a transporter of the 
light-absorbing pigment melanin, dispersing 
melanin-containing structures called melano- 
somes along actin filaments’. Melanosomes are 
cellular subunits that both synthesize and dis- 
play melanin, giving colour to human hair and 
to the coats of other mammals. In mice, muta- 
tions in myosin 5a render the protein unable 
to properly transport melanosomes, resulting 
in a diluted coat colour. 

Although the presence of melanin-contain- 
ing pigment granules in the cuticles of some 
butterfly larvae has been documented”, it is 
unclear exactly how these granules relate to 
the melanosomes found in most vertebrates. 
Precisely how a myosin protein might shunt 
pigment-containing structures around the 
scale of a butterfly wing therefore remains a 
mystery. What is clear, however, is that other 
structural cellular components, such as helical 
actin filaments, are involved in generating 
the microribs and lamellae, structures on 
wing scales that interfere with the wavelength 
of light and thus cause iridescence through 
refraction’. This fact, taken together with 
the current study, suggests that structural 
proteins within the wing scale might play a 
part in both pigment- and refraction-based 
coloration. 

Zhan et al. have taken us beyond the realms 
of Kipling’s stories. The authors have shown 
how current sequencing technologies can 
allow us to look directly at the traits under 
strong selection pressure in the species in 
which selection is actually acting, rather than 
just mice and fruit flies in the laboratory. 
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But, of course, any good study raises more 
questions than answers. 

Whatare the genes that prompt the migratory 
behaviour in North American monarchs? For 
example, does the monarch begin its migra- 
tion in response to shortening day length in 
North America? The authors suggest that 
the molecular mechanisms contributing to 
migration are probably complex, and that the 
pathways involved range from those regulat- 
ing circadian rhythms to those that control 
navigation. However, the gene that their study 
highlights is central to efficient muscle func- 
tion. Although this finding suggests that it is 
the monarch’s muscles that allow them to per- 
form their migration, we still do not under- 
stand the complex role of shortening day 
length and environmental sensing in instruct- 
ing the butterflies to migrate at a certain 
time of year. 

Zhan and colleagues do not describe the 
mutation in the myosin-5a-like gene that 
causes the white nivosus variant, even though 
the answer probably lies within the sequenced 
mutant genomes. It is noteworthy that muta- 
tions within the globular tail domain of myo- 
sin 5a (its putative melanosome-binding site) 
result in a lighter coat colour in mice’. Does 
the nivosus-associated mutation also lie in this 
globular tail and therefore somehow disrupt 
its binding of butterfly pigment granules? 
Furthermore, the dilute-coat mutant in mice 
arises owing to the integration of a virus into 
the mouse genome, and full coat colour can 
be restored by virus excision”. Bizarrely, this 
suggests that, if similar mutations occur in 
the butterfly myosin-5a-like gene, they might 
be unstable, and the butterflies may therefore 
seem to spontaneously appear and disappear 
like white ghosts. = 
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The genetics of monarch butterfly 
migration and warning colouration 


Shuai Zhan), Wei Zhang’, Kristjan Niitepdld*®, Jeremy Hsu’, Juan Fernandez Haeger®, Myron P. Zalucki’, Sonia Altizer®, 
Jacobus C. de Roode’, Steven M. Reppert? & Marcus R. Kronforst? 


The monarch butterfly, Danaus plexippus, is famous for its spectacular annual migration across North America, recent 
worldwide dispersal, and orange warning colouration. Despite decades of study and broad public interest, we know little 
about the genetic basis of these hallmark traits. Here we uncover the history of the monarch’s evolutionary origin and 
global dispersal, characterize the genes and pathways associated with migratory behaviour, and identify the discrete 
genetic basis of warning colouration by sequencing 101 Danaus genomes from around the globe. The results rewrite our 
understanding of this classic system, showing that D. plexippus was ancestrally migratory and dispersed out of North 
America to occupy its broad distribution. We find the strongest signatures of selection associated with migration centre 
on flight muscle function, resulting in greater flight efficiency among migratory monarchs, and that variation in monarch 
warning colouration is controlled by a single myosin gene not previously implicated in insect pigmentation. 


Every year millions of monarch butterflies fly from the northern United 
States and southern Canada to overwinter in Mexico. Notably, during 
this portion of the annual migration, individual butterflies emerge as 
adults in the north, fly thousands of kilometres south, overwinter for 
months in reproductive diapause, and finally begin mating and flying 
north in the spring. Recolonization of northern latitudes takes place 
over the course of three to four subsequent generations, after which it 
is late summer again and the process repeats itself. Although most North 
American monarchs overwinter in Mexico, those that live west of the 
Rocky Mountains generally overwinter along the California coast'*. Mon- 
arch migration has been studied extensively over the past few decades, 
with particular emphasis on tracking migration routes and overwinter- 
ing sites'*’’, as well as characterizing the navigational mechanisms that 
guide this complex behaviour* "’. A largely unappreciated aspect of this 
system is that not all monarchs migrate. In fact, the geographic distri- 
bution of D. plexippus extends far beyond North America and it does 
not migrate across most of its range. For instance, while the monarch 
undergoes extensive, long-distance migration across North America, it 
also exists throughout Central America, South America and the Carib- 
bean, where it does not migrate’*"'®. Furthermore, the monarch has 
recently dispersed to many locations around the globe where it does not 
exhibit the same long-distance migration found in North America’”"’. 

The molecular genetic mechanisms that contribute to migration are 
likely to be complex, spanning pathways related to navigation and cir- 
cadian rhythms, environmental sensing, energy production and meta- 
bolism, thermal tolerance, reproduction and longevity, neuromuscular 
development, and phenotypic plasticity'"””°. Because D. plexippus exists 
as both migratory and non-migratory populations, we sought to use 
comparative population genomics to characterize the genetic basis of 
migration by identifying genome-wide targets of divergent natural selec- 
tion associated with shifts in migratory behaviour. To do this, we sequenced 
the genomes of 89 butterflies—80 D. plexippus and nine samples from 
four additional Danaus species—from across their worldwide distribution 


(Fig. 1a, b) and analysed them using the monarch reference genome 
sequence”’, 

To guide our analysis, we first characterized the evolutionary origin 
of D. plexippus and its history of dispersal. The genus Danaus is broadly 
tropical and non-migratory™, which suggests that non-migratory pop- 
ulations of D. plexippus in South and Central America may represent 
the ancestral source of the North American population”’, similar to the 
‘southern home theory’ for the origin of seasonal migration in birds”. 
More recently, monarchs dispersed across the Pacific, throughout Oce- 
ania and Australia, and they also dispersed across the Atlantic to Europe 
and Africa’’, It is unknown whether the Pacific and Atlantic dispersal 
events were independent, but both are thought to derive from North 
American migratory monarchs’’”’. There is also an enigmatic non- 
migratory population in south Florida’*, which may result from over- 
wintering migratory butterflies that fail to return north. 


Evolutionary and demographic history 

Our analysis, based on genome-wide single nucleotide polymorphism 
(SNP) variation (approximately 32 million SNPs), revealed a history 
quite distinct from our a priori expectations. For instance, we recovered 
North American populations as the most basal lineages, with Central/ 
South America, Pacific and Atlantic populations each forming an inde- 
pendent, derived lineage (Fig. 1 c, d; Extended Data Fig. 1). We also found 
that all geographic sampling locations were genetically distinct, except 
those in North America, providing evidence of worldwide population 
structure but gene flow across North America (Fig. le, f). An exception 
was the non-migratory population in south Florida, which was distinct 
from other North American populations. We also found evidence for 
very recent dispersal between the North American migratory popula- 
tion and adjacent non-migratory populations (Supplementary Informa- 
tion). Our results suggest that the monarch butterfly originated in North 
America froma migratory ancestor, a scenario consistent with the obser- 
vation that all monarch populations, as well as Danaus erippus, share 
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D. plexippus 


coalescent (PSMC) model’® (Extended Data Fig. 2d) and dadi*’ (Ex- 
tended Data Fig. 3) and founda concordant history among all monarch 
populations, but distinct from D. erippus, for most of the last Lmillion 
years. Approximately 20,000 years ago, at the end of the last glacial max- 
imum, the North American population began to grow, presumably 
fuelled by increasing availability of milkweed host plants throughout 
the Midwestern United States and expanding monarch migration. More 
recently, Atlantic and Pacific populations split from North America, at 


D. gilippus which time both experienced dramatic declines in population size. His- 
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Figure 1 | Global dispersal of the monarch butterfly. a, Monarch butterfly 
sampling locations. b, Inferred phylogeny among Danaus species based on 
maximum likelihood analysis of 3,714 single-copy genes. c, Neighbour-joining 
phylogeny of all D. plexippus individuals, based on genome-wide SNP data. 
ATL, Atlantic crossing; CEN, Central America (including south Florida); NOR, 
North America (including Mexico); PAC, Pacific crossing. d, Neighbour- 
joining consensus tree based on 1,000 bootstrap replicates. e, Principal 
component analysis (PCA) plots based on the first two principal components; 
inset shows separation between North America and south Florida. f, Genetic 
structure and individual ancestry; colours in each column represent ancestry 
proportion over range of population sizes K = 2-11. ABW, Aruba; AUS, 
Australia; BLZ, Belize; BMU, Bermuda; CRC, Costa Rica; ECU, Ecuador; ESP, 
Spain; FJI, Fiji; HI, Hawaii; MAR, Morocco; NZL, New Zealand; NCL, New 
Caledonia; PRI, Puerto Rico; PRT, Portugal; s.FL, south Florida; WSM, Samoa. 


reproductive traits and behaviours which may have evolved in the con- 
text of mass migration”*’. We speculate that the monarch originated in 
the southern USA or northern Mexico, where it originally undertook a 
shorter-distance annual migration. Three subsequent, independent dis- 
persal events led to the monarch’s current broad distribution. Towards 
the south, monarchs expanded from Belize to Costa Rica and into South 
America, as well as offshore, from south Florida to Bermuda and Puerto 
Rico. Westwards, they expanded into Hawaii and then to Samoa and 
Fiji before ending up in New Caledonia, Australia and New Zealand. 
Across the Atlantic, monarchs established first in Portugal and then 
moved to Spain and Morocco. 

Our dispersal scenario was further supported by the observation that 
non-North American populations had elevated linkage-disequilibrium 
(Extended Data Fig. 2a) and minor allele frequencies (Extended Data 
Fig. 2b), indicative of founder effects, and heterozygosity declined along 
each putative dispersal route (Extended Data Fig. 2c), as expected of step- 
wise dispersal. Similar step-wise dispersal is reflected in microsatellite 
markers (A. A. Pierce et al., submitted). The directionality index Y for 
range expansions” also supported North America as the monarch’s 
ancestral origin (Extended Data Table 1). We estimated historical popu- 
lation sizes and divergence times using a pairwise sequentially Markovian 
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Figure 2 | A selective sweep associated with migration. a, Distribution of PBS 
and polymorphism in North America (No), calculated in 5-kb sliding 
windows. Migration-associated genomic regions were identified as the points 
above the dashed line (P < 0.01) and to the left of the vertical green dashed 
line (lower quartile). Circled points consist of a single 21-kb region. 

b, Population genetic statistics were plotted across DPSCF300190 in 5-kb 
sliding windows. c, Gene models and SNP allele: white represents homozygous 
for the reference allele; red, homozygous for alternative allele; yellow, 
heterozygous; grey, masked site. 
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in the 1800s’, but our results indicate an earlier timeframe (Supplemen- 
tary Information). 


Natural selection associated with migration 


We used a modified version of the population-branch statistic (PBS)® 
to identify regions of the genome strongly differentiating North American 
monarchs from all three transitions to non-migratory behaviour (Fig. 2a). 
By further limiting this search to regions of low sequence diversity within 
North America, we isolated 5.14 megabases (Mb; 2.1% of the genome), 
encompassing 536 genes, significantly associated (Z-test, P< 0.01) with 
migration. This set was enriched for genes related to morphogenesis, 
neurogenesis, and extracellular matrix/basement membrane. Derived- 
allele frequency was elevated among monarchs in these migration-associated 
genomic regions (Fig. 2a), further suggesting a history of natural selec- 
tion. Derived alleles were similarly enriched in D. erippus, consistent with 
the observation that this species also displays migratory behaviour’*”, 
and suggesting that the common ancestor of D. plexippus and D. erippus 
was migratory as well. 

We were surprised to find that among the approximately 5 Mb asso- 
ciated with migration, a single 21 kilobases (kb) genomic segment stood 
out as an extreme outlier (Fig. 2a). This region showed multiple signa- 
tures of divergent selection (Fig. 2b) as well as an enrichment of shared 
alleles (ABBA versus BABA sites”) among Atlantic, Pacific and Central/ 
South American dispersal events, indicating a shared haplotype among 
all non-migratory populations that was highly divergent from the haplo- 
type in North America. This signature of haplotype sharing among non- 
migratory populations is likely owing to recurrent selection on ancestral 
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variation, as opposed to gene flow, because the non-migratory haplo- 
type was present at low frequency in our North American samples (for 
example, sample 203 from New Jersey was heterozygous). 

The 21-kb outlier region contained three genes: the F-box protein 
FBXO45, an uncharacterized transmembrane protein, and collagen 
type IV, subunit «-1 (Fig. 2c). By comparing these three genes we found 
evidence for divergent selection on collagen IV a-1 (Fig. 3). Collagen IV 
a-1 showed marked divergence between haplotypes found in migra- 
tory and non-migratory populations (Fig. 3a, b), apparently resulting from 
an ancient origin of the non-migratory haplotype (Fig. 3c). Collagen IV 
is a central component of basement membranes and essential for mus- 
cle morphogenesis and function®®. Mutations in the «-1 subunit result 
in severe myopathy in Drosophila*' and myopathy-related disease in 
humans”. Migratory and non-migratory haplotypes differed by 51 nucle- 
otide substitutions in the coding sequence of collagen IV «-1, resulting 
in 15 amino acid substitutions. A subsection of the gene showed parti- 
cularly high diversity within D. plexippus, as well as divergence between 
D. plexippus and other species, centred on the single amino acid substi- 
tution with evidence of positive selection, R1573Q (Fig. 3d). Collagen 
type IV is a heterotrimer composed of two o-1 chains and one «-2 chain, 
which bind at shared triple helix domains. Danaus plexippus collagen IV 
a-1 contains five triple helix domains and the R1573Q substitution 
occurred directly in the middle of one of these, suggesting a functional 
role related to trimerization. Interestingly, we found collagen IV subunit 
a-2 in a nearby genomic window, with reduced but still highly signifi- 
cant signatures of selection (Extended Data Table 1), providing addi- 
tional evidence of selection on the interacting members of collagen IV. 


Figure 3 | Divergent selection on collagen IV 
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Furthermore, other genomic intervals strongly associated with migra- 
tion overlapped with portions of another well-characterized flight muscle 
gene, kettin*? (Extended Data Table 2). 

We hypothesized that the signatures of divergence associated with 
these essential muscle genes reflected selection for different flight muscle 
function between migratory and non-migratory butterflies. Consistent 
with this, we found divergent expression of collagen IV a-1 and o-2, but 
not other linked genes, between butterflies from migratory and non- 
migratory populations in adult thoracic muscle tissue (Fig. 3e). Surpris- 
ingly, collagen IV subunit -1 and o-2 were downregulated in migratory 
butterflies, leading us to hypothesize that natural selection may be acting 
on aspects of flight efficiency with migratory populations tuned to the 
distinct demands of long-distance flight. This scenario is supported by 
evidence of distinct wing shape and size, body mass and kinematic wing 
loading between migratory and non-migratory populations’’, but we 
tested it by measuring flight metabolic rates. We found active flight to 
be exceptionally demanding energetically, using 25 times more energy 
than resting. Migrating monarchs are known to offset this to some extent 
by gliding for periods of time on tail winds**. Consistent with our hypo- 
thesis that flight muscle changes have resulted in more efficient energy 
consumption in migratory populations, we found that flight metabolic 
rates were lower in butterflies from a migratory population (Massachu- 
setts), compared to one non-migratory population (south Florida) (Fig. 3f). 
This increase in metabolic efficiency seems to bea result of flight muscle 
performance, because the difference between migratory and non-migratory 
populations was minimal when not in flight (Fig. 3f). Furthermore, we 
found little sequence or gene expression divergence associated with gly- 
colytic enzymes (Supplementary Information), which could also influ- 
ence metabolic rates’. It is interesting that, although previous work has 
found a link between flight metabolism and dispersal ability in other 
butterfly species**”, it has always been via glycolysis and generally in 
the form of higher metabolic rates yielding greater dispersal. In con- 
trast, the extreme distances required of monarch migration seem to 
have generated natural selection for reduced flight metabolism, which 
has been mediated by alternate mechanisms. Parallel shifts to the same 
non-migratory collagen IV «-1 haplotype in independent dispersal events 
suggest that an elevated flight metabolic rate may be beneficial in the 
absence of long-distance migration. 


The genetic basis of wing pigmentation 


Another aspect of monarch biology that has attracted attention is their 
bold warning colouration. The monarch butterfly, like Danaus species 
generally", is characterized by bright orange wing colouration that warns 
predators of their toxicity**. This colouration also serves to facilitate 
Miillerian mimicry between D. plexippus and the Viceroy butterfly, 
Limenitis archippus”. It is not well-appreciated that the monarch is 
polymorphic for wing colouration (Fig. 4a). On Oahu, Hawaii, the white 
‘nivosus morph has been documented since the mid 1890s*°. Previous 
breeding experiments have shown that wing colouration segregates as 
a single, autosomal locus with the white nivosus allele recessive to wild 
type*’. The nivosus wing colouration and inheritance pattern has led to 
speculation that the mutation probably disrupts the production of orange 
pigment”’. Red, orange and brown pigments on nymphalid butterfly 
wings are frequently ommochrome pigments” so we hypothesized that 
the nivosus mutation would be found somewhere in the ommochrome 
biosynthesis pathway. 

Our population genomic data provided a means of characterizing 
the monarch colour switch locus at a molecular level. To do this, we 
sequenced 12 additional Hawaiian monarch genomes, five white indi- 
viduals and seven wild-type orange monarchs, all reared at the same 
time, in the same location. Three of these wild-type monarchs were known 
relatives of white monarchs, two were F, offspring and one was an F, 
offspring of a white parent. By scanning SNP genotypes for segregation 
patterns consistent with the Mendelian genetics (Fig. 4b), cross-referencing 
genotypes across the entire set of 101 genomes, and further testing co- 
segregation in crosses (Fig. 4c), we found that markers in one gene, the 
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Figure 4 | The genetic basis of warning colouration. a, Although D. plexippus 
is generally bright orange, the nivosus morph lacks orange pigmentation. 

b, A comparison of 12 Hawaiian monarch genome sequences (5 wild-type, 

5 nivosus and 2 F, hybrids) reveals perfect SNP associations in one gene, the 
myosin gene DPOGS206617. p, probability of association. c, Comparison of 
DNA sequence divergence (D,,,) between D. plexippus and D. chrysippus shows 
strong purifying selection in exon 2, coinciding with SNP associations in 
modern samples, crosses and field collections from the 1980s. SNP position 785 
is associated in 17/20 samples from the 1980s (P = 0.02, one-tailed Fisher’s 
exact test). 


myosin gene DPOGS206617, were strongly associated with wing colour. 
Interestingly, this gene is homologous to myosin 5a, which is respon- 
sible for the ‘dilute’ mouse coat colour mutant*’, a phenotype result- 
ing from reduced melanization due to impaired myosin transport of 
melanosomes™. This suggests that an alteration in pigment transport, 
and not pigment production, underlies the nivosus form. We speculate 
that this gene, while not previously implicated in insect pigmentation, 
may be an important source of colour pattern variation across the but- 
terfly subfamily Danainae because genera closely related to Danaus are 
dominated by white-winged species, and other Danaus species display 
similar orange versus white variation”. 


Discussion 


We have leveraged exceptional natural variation and extensive genome 
sequencing to characterize the monarch butterfly’s evolutionary origin 
and history of dispersal, genome-wide signatures of divergent selection 
associated with migratory behaviour, and the discrete genetic basis of 
warning colouration. Our results yielded unexpected answers in all three 
aspects. Not only did we re-polarize the ancestral migratory character 
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state and geographic origin for the monarch, but we also found evid- 
ence for recurrent, divergent selection on flight muscle function dur- 
ing shifts in migratory behaviour, probably mediated by their role in 
influencing flight efficiency. Surprisingly, as monarchs have reverted 
to anon-migratory state, which is an ancestral state that pre-dates their 
own species and that of their common ancestor with D. erippus, in the 
case of collagen IV o-1 at least, they seem to have used old genetic var- 
iation to do so. Furthermore, wing colour variation is mediated by a 
gene, the myosin gene DPOGS206617, with no prior known role in insect 
pigmentation but with an analogous effect in vertebrates. 
Unfortunately, the monarch migration is currently experiencing a 
devastating decline and there is fear that the phenomenon may dis- 
appear entirely. Recent monitoring shows an alarming downward trend 
in monarch numbers from eastern North America, with 2013 marking 
the lowest number of overwintering monarchs in recorded history”. 
This decline has been driven by multiple factors, including deforesta- 
tion, drought and a precipitous drop-off in the number of milkweed 
host plants across North America®’. Our results emphasize the impor- 
tance of ongoing conservation efforts to preserve the migration and 
extend the extraordinary evolutionary history of this iconic butterfly. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Sampling and sequencing. We sampled a total of 101 butterflies, including 92 
D. plexippus and nine butterflies from other Danaus species (Supplementary Table 1). 
The North America population of monarchs undergoes a yearly migration from the 
United States and southern Canada to central Mexico (east of the Rocky Mountains) 
and coastal California (west of the Rocky Mountains). Our sampling sites for the 
migratory group covered several stopover points and the overwintering grounds 
for both eastern and western populations (Fig. 1a, green points). For comparison, 
we sampled residential, non-migratory populations from three major geographic 
regions. In the south, non-migratory populations were sampled from south Florida 
(around the city of Miami), throughout the Caribbean, and Central and South America 
(Fig. 1a, red points). Out of the Americas, we also sampled monarch populations 
across the Pacific (Fig. 1a, blue points), including Hawaii and Oceania, and popula- 
tions across the Atlantic (Fig. 1a, purple points), spanning Iberia and North Africa. 

It is important to note that D. plexippus is known to move seasonally outside of 
North America. For instance, seasonal movement through mountain passes has 
been recorded in Costa Rica** and seasonal movement between inland and coastal 
locations is well-known in Australia*’”. While this behaviour is properly referred 
to as migration, it is very different from the migration of North American mon- 
archs and these phenomena are routinely distinguished in the literature. First, these 
phenomena differ by orders of magnitude in their scope, with millions to a billion 
individuals moving thousands of kilometres in North America and hundreds to 
thousands individuals moving tens to hundreds of kilometres elsewhere. Second, 
outside of North America overwintering biology, such as sexual diapause, is highly 
labile or not known to exist. Similarly, other Danaus species, such as D. chrysippus 
are well-known to move seasonally** but the scale is relatively small in comparison 
to D. plexippus in North America. There is, however, evidence that D. erippus migrates 
in South America in a way that mirrors D. plexippus migration in North America”. 
Critically, our genetic data support these expectations and distinctions by showing 
that particular genetic signatures distinguish taxa (populations/species) with differ- 
ing migratory behaviour based on the literature. In our study, we refer to D. plexippus 
populations from North America (excluding south Florida) as ‘migratory’ and 
D. plexippus populations from other geographic locations as ‘non-migratory’ to 
capture this distinction in their biology. 

All samples were sequenced on the Illumina sequencing platform (HiSeq 2000). 

Paired-end libraries were prepared using an Illumina paired-end library kit. We 
combined between 4 and 8 samples in a sequencing lane (2 X 100 bp) to generate 
approximately 10X and 20X raw coverage for D. plexippus and outgroup species, 
respectively. A total of 384.6 Gb of paired-end sequence data were generated (Sup- 
plementary Table 2). 
Alignment, SNP calling and genotyping. Before mapping, all reads were processed 
for quality control and filtered using Seqtk (https://github.com/lh3/seqtk). 366.9 Gb 
high-quality read pairs were kept and mapped to the latest version of the monarch 
genome assembly (Supplementary Table 2). Based on the result of our preliminary 
test, we chose Stampy v1.0.21” as our main mapping software, but we also applied 
other mapping methods as independent quality controls (Supplementary Table 3). 
A randomly selected subset of reads was mapped in advance to estimate the appro- 
priate parameters for insert size and substitution rate for each library. Mapping 
results were subsequently processed by sorting, indel realignment, duplicate marking, 
and low quality filtering using functions in Picard v1.8 (http://picard.sourceforge.net) 
and GATK2™. Sequencing coverage and depth for each sample were calculated using 
the ‘DepthOfCoverage’ module of GATK2. 

Since we had a wide range of sequencing coverage among samples (Supplemen- 
tary Table 2), we carried out SNP calling and genotyping at two separate stages to 
balance the power between low and high coverage samples. We first discovered 
variants on a population-scale using a variety of independent pipelines, which both 
introduced different alignment inputs and covered most popular SNP calling algo- 
rithms (Supplementary Table 3). By comparing methods, we determined a core set 
of SNPs according to the sensitivity and specificity of each pipeline, as well as using 
combinations of pipelines (Supplementary Table 4 and Supplementary Table 5). Using 
the consensus set of SNPs, we went back to each sample to estimate the correspond- 
ing genotype likelihoods from all alignment sources. Based on the comparison, 
genotypes called from the stampy alignment showed the overall minimum differ- 
ence with other independent methods (Supplementary Table 6). We further filtered 
out variants from regions with abnormal sequencing coverage and constructed a 
core SNP matrix. In this final data set, 99.1% of the genotypes were independently 
supported by additional evidence, suggesting a reliable input for the subsequent 
population genetic analysis. Also, unlike results obtained using a single SNP scoring 
pipeline, this method substantially reduced the correlation between the identified 
number of SNPs and sequencing depth. 

Outgroup species. Since divergence between D. plexippus and other Danaus species 
was high, genotypes within highly divergent regions were likely to be miscalled or 
filtered out. We therefore performed de novo assemblies for outgroup species for 


analyses that were sensitive to the quality of consensus sequence. Contigs were 
assembled for each outgroup sample, a minimum length of 300 bp were kept and 
processed with a redundancy filter step as described previously”. 

We chose the highest quality assemblies (Chry_AUS_113_M, Eres_FL_27M, 
Erip_BRA_16005_F, and Gili_FL_28_F) to infer the species phylogeny based on 7,251 
single-copy universal orthologues that were identified previously among lepidop- 
teran genomes (D. plexippus*', H. melpomene** and B. mori®*). The OGS2.0 gene 
models of D. plexippus were used for homology search by TBLASTN. The high- 
scoring pairs with E< 10° were then processed by genblastA v1.0.4*" and gene 
structures were determined by GeneWise v2.2.0°°. 3,714 proteins that were recov- 
ered with =50% coverage in all four outgroup species were kept, conserved blocks 
were extracted using Gblocks v0.91b”*, and were concatenated to seven super genes 
with 908,188 amino acids. The species phylogenetic tree was calculated using Phy ML 
v3°’ with the JTT model and 100 replicates of bootstrap analyses. Our species tree 
resulted in clear separation among all sequenced Danaus species, including puta- 
tive sister-species, D. plexippus and D. erippus. 

Population structure. We used all bi-allelic and high quality SNPs to infer phy- 
logeography and population structure for D. plexippus. For phylogeny, pairwise 
genetic distances were calculated among all samples as described previously** and 
a tree was subsequently generated (Fig. 1c) using the neighbour-joining (NJ) method 
implemented in PHYLIP v3.695°. A second frequency tree was also generated 
(Fig. 1d) based on 1000 bootstrap replicates using the consense module of PHYLIP. 

We also inferred a population-level phylogeny using the maximum likelihood 
approach implemented in Treemix®. This method was designed specifically to infer 
patterns of population splitting events from genome-wide allele frequency data. For 
this analysis, we excluded samples that appeared to have recently dispersed among 
our geographic locations and we filtered out singleton SNP sites (MAF <0.05). All 
subsequent data analysis was performed with Treemix v1.11 using parameters 
“-global’ to generate the ML tree (Extended Data Fig. 1). 

Population genetic structure and individual ancestry proportions (admixture) 
were inferred using FRAPPE v1.1°'. We increased the pre-defined genetic clusters 
from K = 2 to K = 11 and ran analysis with 10,000 maximum iterations. We also 
performed principal component analysis (PCA) using the package EIGENSOFT 
v5.0°. A Tracy-Widom test was used to determine the significance level of the 
eigenvectors. 

Based on the complete NJ tree, FRAPPE results and PCA clustering, it was 
immediately clear that three D. plexippus samples had recently dispersed among 
geographic locations. Sample Plex_BLZ_4_M was collected in Belize but clustered 
with North American samples, Plex_FLs_MIA16_M was collected in south Florida 
but clustered with migratory North American samples, and Plex_MA_HI032_M 
was collected in Massachusetts but clustered with Bermuda. Since these are all 
putative exchanges among geographically proximate locations, we suspect they 
represent real dispersal events as opposed to sample mix-ups. It is also worth not- 
ing that Plex_FLs_MIA16_M is the only sample we included from south Florida 
that was collected during the winter, which suggests that North American migra- 
tory monarchs end up in south Florida where they are in contact with the genet- 
ically distinct non-migratory population”. This may also explain why this sample, 
and no other south Florida samples, emerged as a recent dispersal event. A small 
number of additional North American samples were population outliers in either 
FRAPPE or PCA, suggesting potential admixture or contamination. We removed 
these samples, those with low sequence coverage, and the three recent dispersers 
from subsequent analyses (Supplementary Table 7). We note however that we per- 
formed a second analysis including all samples and found that we were still able to 
clearly identify all regions of the genome strongly associated with migration (below). 
Demographic analysis. We compared patterns of linkage disequilibrium (LD) and 
minor allele frequency (MAF) among populations. To estimate linkage disequi- 
librium, we calculated 7* using Haploview v4.2" with parameters “-maxdistance 
160 -dprime -minGeno 0.6 -minMAF 0.1 -hwcutoff 0.001”. Global patterns of LD 
and MAF were compared for the four main clusters of monarchs (Extended Data 
Fig. 2a and b). We found high LD and MAF in Atlantic and Pacific populations, 
consistent with inbreeding and population bottlenecks. Central/South America 
was intermediate between these populations and North America. We also esti- 
mated heterozygosity for each sample, calculated as the ratio of heterozygous to 
homozygous variants for each sample (Extended Data Fig. 2c). 

We used the directionality index ¥”° to test the occurrence of a range expan- 
sion and to infer the origin of the range expansion (Extended Data Table 1). For 
this analysis, we used biallelic SNPs showing consensus genotypes across all out- 
group individuals, and we defined the ancestral allele as that consensus outgroup 
allele. After excluding sites where one or both of the two focal populations ($1 and 
S2) was fixed for the ancestral allele, we calculated a Two-dimensional site derived 
allele frequency spectrum between populations as described previously”. 

We inferred demographic history for D. plexippus using the pairwise sequen- 
tially Markovian coalescence (PSMC) model”. To ensure the quality of consensus 
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sequence, we only used representative samples of high sequencing depth for each 
geographic region. Processed alignments of individuals were transformed to the 
whole-genome diploid consensus sequence using SAMTOOLS®. Bases of low 
sequencing depth (a third of the average depth) or high depth (twice of the average) 
were masked. We then used “fq2psmcfa” to transform the consensus sequence into 
a fasta-like format where the i-th character in the output sequence indicates whether 
there is at least one heterozygote in the bin of 20 bp. Parameters were set as follows: 
“<p 44+5*3+13*2+5*3+4 -r 2”. The monarch generation time (g) was set as an 
estimate of 0.3 years. We used a standard mutation rate (1) of 8.4 10°, from 
Drosophila®®. Note, if we use a lower estimate of the mutation rate”, all results from 
the PSMC and dadi analyses (below) remain qualitatively the same but inferred 
divergence times are older and effective population sizes are larger. 

Wealso inferred the demographic history of the major geographic regions using 
diffusion approximation for demographic inference (@aéi’’), which employs SNP 
frequency data for populations rather than recombination events within individual 
genomes. For this analysis, we analysed the North American population alone and 
then considered only pairwise comparisons between North America and each of 
the major dispersal populations (South/Central America, Atlantic, Pacific). In an 
attempt to avoid selected sites, we only used SNPs from intergenic regions on auto- 
somal scaffolds. We calculated folded frequency spectra since there is no trinucleo- 
tide substitution matrix that can be used for statistical correction. As suggested, we 
specified simple models first and fit the model by increasing complexity gradually. 
A likelihood ratio test was used to optimize model selection, with the best model 
pictured in Extended Data Fig. 3a, although we note it is hard to rule out more com- 
plex demographic scenarios. Scaled parameters from the most likely model were 
transformed using the same g and 1 as above. We also performed nonparametric 
bootstrapping (100 times) to determine the variance of each parameter (Extended 
Data Fig. 3). 

Historical records suggest Atlantic and Pacific dispersal events of the monarch 
butterfly occurred in the 1800s'”**. These records are largely based on sightings 
from early European explorers who do not note the monarch in locations at certain 
times and then others who note the monarch in abundance only years later. Our 
demographic analyses based on genome sequence data are inconsistent with this 
timing. For instance, our PSMC analyses suggest Pacific and Atlantic dispersal events 
may have occurred as early as 2,000-3,000 years ago. Because this timing was unex- 
pected, we performed a follow-up analysis using 0a0i’’ and this also yielded split 
times of 2,000-3,000 years ago between North America and the Atlantic and Pacific 
populations (Extended Data Fig. 3). Interestingly, the dadi analysis further indicated 
recent bottleneck recovery in the past 200-500 years, perhaps pointing to trans- 
oceanic dispersal events that were initially seeded thousands of years ago but which 
spread widely only within the last 200 years. This may provide some link between 
the genetic data and the historical records. If our demographic inference based on 
the genomic data are correct, where has the monarch been for all this time? The 
most common monarch host plants are recent introductions in the Pacific but 
there are native host plants in Southeast Asia. In addition, there is apparently a long 
history of the monarch butterfly in New Zealand. For instance, the indigenous Maori 
people of New Zealand believe the monarch is native to New Zealand, and unlike 
other Pacific locations, they have a traditional name for the monarch butterfly®. 
On the Atlantic side, it is possible the monarch has co-occurred with congener D. 
chrysippus in North Africa and on the Canary Islands, using the same host plants. 
We stress that the ancient Atlantic and Pacific dispersal scenarios we outline here 
are speculative, but plausible, and they would be in line with our genomic results. 
Identification of migration-associated genomic regions. We applied a sliding 
window approach (5-kb windows sliding in 500-bp steps) to identify genomic 
regions associated with migration. Several statistical features were considered and 
compared. Based on the evolutionary scenario, we employed a modified popula- 
tion branch statistic (PBS) approach, which originally showed power to detect incom- 
plete selective sweeps over short divergence times”, a scenario that is highly relevant 
here. Our approach was to search for genomic regions separating North America 
from all three independent losses of migration (South/Central America, Atlantic, 
Pacific). Based on the original PBS algorithm, we specifically modified the formula 
as PBS = (TNC + TN? + TNA— TCP — TA) / 3, where T*® is the log trans- 
formed F,; between population A and B, etc. (A, Atlantic; C, Central/South America; 
N, North America; P, Pacific). We further restricted this to windows in the lowest 
quartile distribution of pairwise nucleotide polymorphism (7, ref. 69) within North 
America. At a significance of P< 0.01 (Z test), we identified a total of 5.14 Mb 
(2.1%) of the genome, including 536 predicted genes, which were associated with 
migration (Supplementary Table 8). If we did not restrict our gene set by low z in 
North America, our list of migration-associated genes included an additional 154 
genes (Supplementary Table 8). We calculated a variety of other statistics, includ- 
ing Tajima’s D”’, LD, difference in the number of ABBA versus BABA” sites, and 
derived allele frequency (DAF) for each sliding window. To estimate DAF, we inferred 
ancestral alleles using the consensus sequence of the outgroup taxa. We found that 


ARTICLE 


DAF was notably enriched in our migration-associated genomic regions, relative 
to the rest of the genome, and this was true in both D. plexippus and D. erippus. 
Annotation. We annotated genes in migration-associated genomic regions using 
the monarch OGS2.0 gene models and related information from MonarchBase”! 
(Extended Data Table 2, Supplementary Table 8). We additionally annotated the 
genes within functional categories based on the corresponding Drosophila mel- 
anogaster orthologues using DAVID online platform v6.7”. Functional enrichments 
are presented in Supplementary Table 9 and Supplementary Table 10. We also speci- 
fically examined PBS and gene expression in glycolysis enzymes (Supplementary 
Table 11). 

Targeted gene analysis. We performed a targeted population genetic analysis of 
three adjacent genes on scaffold DPSCF300190; FBX045, DPOGS206536, and col- 
lagen IV a-1. For each gene, CDS was extracted for samples from the population 
resequencing data and average pairwise sequence divergence and Fy were estimated 
between migratory and non-migratory populations using Arlequin v3.5”. We used 
the program Network v4.612 (Fluxus Engineering) to generate gene networks and 
MEGA v6" to infer a maximum-likelihood gene tree for collagen IV «1-1 under the 
GTR+I+G model. We used DnaSP v5.10” to compare sequence polymorphism 
among the three genes using the Hudson-Kreitman-Aguadé test”®. We also per- 
formed sliding window analyses of sequence polymorphism (7) and between spe- 
cies (D. plexippus, D. chrysippus) divergence (Dxy) along collagen IV «-1 CDS using 
DnaSP. Codon-based models of adaptive protein evolution were implemented 
using the Datamonkey webserver (http://datamonkey.org/). Specifically, we ran 
FUBAR”, MEME”, and FEL” tests on an alignment of lepidopteran collagen IV 
a-1 sequences. The average dN/dS across collagen IV «-1 was 0.16 but all tests were 
suggestive of positive selection on the R1573Q substitution while other sites were 
found to be under negative or no selection (FUBAR: dN = 10.5, dS = 1, norma- 
lized dN/dS = 9.5, Bayes factor = 87; MEME: dN(B*) = 110 (P[B = B+] = 0.46), 
dS = 0.47, P = 0.08; FEL: dN = 55, dS = 10° ©, normalized dN/dS = 83, P = 0.10). 
RNA-seq. We extracted total RNA from the adult thoracic muscle tissue of six 
D. plexippus samples, one male and one female from Massachusetts, Hawaii, and 
Costa Rica. The samples from Hawaii and Costa Rica were from non-migratory 
populations and the samples from Massachusetts were from a non-migratory summer 
generation of the migratory North American population. RNA-seq libraries were 
prepared using an Illumina TruSeq protocol, individually indexed and sequenced 
on one lane of HiSeq 2000. After QC processing of raw data, differential gene expres- 
sion of target genes was analysed using TopHat v2.0.7°° and CuffLinks v2.1.1°". 
Respirometry. We measured resting and flight metabolic rates using flow-through 
respirometry**. The material consisted of 60, 2-day old females, originating from 
Massachusetts (1 = 19), south Florida (n = 19), and Costa Rica (n = 22). The sam- 
ples from south Florida and Costa Rica were from non-migratory populations and 
the samples from Massachusetts were from a non-migratory summer generation 
of the migratory North American population. Individuals were placed in a 2-liter 
cylindrical, transparent respirometry chamber and covered with a dark cloth. We 
pumped dried, CO>-free air through the chamber and used a mass flow meter and 
controller (Sierra Instruments, Monterey, CA, USA) to regulate the STP-corrected 
flow rate of 1.81min~!. We used a regularly calibrated differential, infrared CO, 
analyser (Li-Cor 7000, Li-Cor, Lincoln, NE, USA) to measure the CO, emission rate. 
We converted the signals from analogue to digital using a Sable Systems Universal 
Interface (UI-2) and collected the data using ExpeData (Sable Systems, Reno, NV, 
USA). After the individual had rested motionless in the darkened chamber for approx- 
imately 15 min and the CO, emission curve had reached a stable baseline, we started 
recording resting metabolic rate (RMR). We recorded a minimum of 10 min of 
stable, undisturbed RMR. The measurements took place in a temperature con- 
trolled room. We used a NTC thermistor probe (Sable Systems) to continuously 
measure the temperature inside the chamber. The mean temperature across the 
RMR measurements was 32.2 °C (s.e.m. 0.07). 

After the RMR recording, we removed the dark cloth and exposed the butterfly 
to bright light (two 25 W ultraviolet/visible light bulbs and one 26 W fluorescent 
light bulb). We allowed the butterfly to adjust to the light for 30s after which we 
began to stimulate it to fly by sharply shaking the chamber after which the indi- 
vidual took off. Once it alighted, we shook it up in the air again. The shaking con- 
tinued for 10 min with the aim of producing continuous flight. After 10 min, we 
stopped the stimulation and covered the chamber with the dark cloth. We allowed 
the CO, curve to return to the baseline. We performed an instantaneous or Z- 
correction* on all metabolic rate data to compensate for delayed washout of CO, 
in the respirometry chamber. To characterize flight performance, we focused on 
total flight metabolic rate (FMR), which consists of the total volume of CO, pro- 
duced during the 10-min flight experiment. 

For analysis, we compared the three populations using analysis of covariance 
(ANCOVA). Time of the day, temperature and body size (pupal mass) were added 
as covariates to the models. There were significant differences in RMR among the 
three populations (F25; = 6.11, P = 0.004). Pupal mass had a positive effect on RMR 
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(F,,51 = 14.98, P = 0.0003). There was a marginally significant quadratic time effect, 
suggesting that RMR peaked in early afternoon (time: F,,s1 = 4.09, P = 0.048; time”: 
F,5; = 4.03, P = 0.0499). The measurement temperature main effect was non- 
significant (F,,5; = 1.08, P = 0.30) but there was a significant interaction between 
population and temperature (F},51 = 6.30, P = 0.004), suggesting a positive rela- 
tionship between temperature and RMR in Florida and Costa Rica, but a negative 
relationship in Massachusetts. A Tukey’s HSD (honest significant difference) test 
revealed no significant RMR differences in pairwise comparisons among popula- 
tions (P > 0.05 in all pairwise comparisons). FMR differed among populations 
(F256 = 6.43, P = 0.003). Pupal mass had a positive effect on FMR (Fi 56 = 9.38, 
P= 0.0034). A Tukey’s HSD test showed that mass-independent FMR was dif- 
ferent between Massachusetts and south Florida (P = 0.0025) but not between 
Massachusetts and Costa Rica (P = 0.5837). 

Association mapping of colour gene. We extracted genomic DNA from 12 indi- 
viduals (Supplementary Table 1) and constructed Illumina paired-end libraries 
using the [lumina TruSeq protocol. 12 libraries were indexed and pooled into 2 
lanes and sequenced using Illumina Hiseq2000. Only high quality reads that 
passed QC step were used for downstream analyses. Genome resequencing data 
were aligned to D. plexippus reference genome sequence using Bowtie2 v2.1.0° 
with parameter —very-sensitive-local and then were re-ordered and sorted by 
Picard v1.84 (http://picard.sourceforge.net). RealignerTargetCreator and Indel- 
Realigner in GATK v2.1 were used to realign indels and UnifiedGenotyper was 
used to call genotypes across 12 individuals using following parameters: hetero- 
zygosity 0.01, stand_call_conf 50, stand_emit_conf 10, dcov 250. 10,034,303 SNPs 
and 1,434,642 indels supported by more than 10 individuals and with good quality 
(Q>30) were kept for further analysis. Association tests were performed using 
PLINK v1.07" and variants with P < 0.005 (Fisher’s exact test) were selected. Can- 
didate loci were checked using customized scripts and those with strong linkage 
patterns (containing > 10 associated variants) within gene regions and satisfied the 
known genotypes were picked up for PCR verification (Supplementary Table 12). 
Wealso repeated this analysis after excluding 4 of the 12 samples with lower genome 
sequence coverage and the results were identical, yielding the highest genome-wide 
SNP associations in the myosin gene DPOGS206617. 

We further tested potentially associated polymorphisms in families and field- 
collected samples provided by J. Stimson**“’. We extracted genomic DNA from 
58 historic specimens (Supplementary Table 13) and performed whole-genome 
amplification using GenomePlex Complete Whole Genome Amplification Kit 
(Sigma). We designed primers to span polymorphisms on five potentially associated 
scaffolds (Supplementary Table 12). Amplification efficiency was low because of 
the age of these specimens. However, after Sanger sequencing all positive PCR pro- 
ducts and subsequent analysis of genotypes, amplified region 1013900-1013989 
yielded a very strongly associated SNP, position 785 in the myosin gene DPOGS206617 
(Supplementary Table 12 and Supplementary Table 14). This section of DPOGS206617 
is very likely to house the causative variation responsible for monarch colour vari- 
ation because it contains SNPs perfectly associated with colour in our full-genome 
sequence data, a SNP very strongly associated in our targeted analysis of historical 
specimens, a signature of long-term purifying selection over evolutionary time, 
and notably, the white-associated alleles in this region are derived alleles that are 
found in no other monarch sample among our 101 sequencing panel. 
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Extended Data Figure 1 | Relationships among monarch populations 
inferred using the maximum likelihood method implemented in Treemix. 
Note, this is a fully resolved, bifurcating tree. The very short basal branches 
indicate little genetic drift in North American populations, not unresolved basal 


relationships. Colours correspond to those in Fig. 1. Treemix also inferred five 
migration events among populations: from Puerto Rico to Aruba, from Puerto 
Rico to Costa Rica, from New Caledonia to Fiji, from Belize or Costa Rica 

to Portugal, and from Belize to Puerto Rico. 
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Extended Data Figure 2 | Demographic history of the monarch butterfly. heterozygous SNPs to homozygous SNPs/individual. d, Demographic history 


a, Patterns of linkage-disequilibrium decay across the genome in different inferred using PSMC. This analysis includes representative individuals of high 
geographic populations. b, Genome-wide distribution of minor allele sequencing depth for each geographic location. The period of the last glacial 
frequencies. c, Heterozygosity across populations, estimated as the ratio of maximum (LGM; ~20,000 years ago) is shaded in grey. 
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Extended Data Figure 3 | adi analysis parameter estimates. a, Schematic 
of demographic scenario modelled in adi labelled with parameters being 
estimated. Nu, effective population size (individuals); m, migration rate 
(individuals/year); T, time (years). b, Inferred parameter estimates. c, One- 
dimensional model-data comparison considering North America population 


only. In the left panel, the model is plotted in red and the data in blue. In the 
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right panel, the residuals between model and data are plotted. d, Two- 
dimensional comparison for joint estimation of North America and dispersal 
populations (Central/South America, Pacific, Atlantic). The left two panels are 
marginal spectra for data and the maximum-likelihood model, respectively. 
The right two panels show the residuals. 
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Extended Data Table 1 | Inferring the monarch range expansion 


$1 $2 Fsr y 

North America south Florida 0.0472 -0.4219 
Caribbean 0.0960 -0.8639 
Central America 0.0473 -0.4765 
South America 0.1263 -1.0493 
Pacific 0.0929 -1.4988 
Atlantic 0.1054 -1.0268 


A positive Y indicates that population S1 is farther away from the origin of the expansion than S2 whereas a negative value indicates that S2 is farther from the origin of the range expansion. North America (S1) 
includes the United States and Mexico but excludes south Florida; Caribbean includes Bermuda and Puerto Rico; Central America includes Belize and Costa Rica; South America includes Aruba and Ecuador; 
Pacific includes Hawaii, Samoa, Fiji, New Caledonia, Australia and New Zealand; Atlantic includes Portugal, Spain and Morocco. 
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Extended Data Table 2 | Top 20 migration-associated genomic regions 


Scaffold Position (Kb) PBS Involved genes 
DPSCF300190 | 186.5 -207.5 0.406 — FBXO45 

— Transmembrane protein 

— Collagen alpha-1(IV) 
DPSCF300134 | 1-61.5 0.086 — IGF-II mRNA-binding protein 

— Insulin-like growth factor 2 

—__2 monarch hypothetical proteins 
DPSCF300190 | 158.5-186 0.081 — dipeptidyl-peptidase | 

— __2WD-40 transcription factors 
DPSCF300001 | 3799.5-3811.5 0.080 -_ Acyltransferase 3 | 
DPSCF300005 | 105.5-119 0.067 — Lepidopteran hypothetical 

protein 

DPSCF300001 | 4632.5-4644 0.064 — Kettin 

—__ pleiotrophin-like protein 
DPSCF300014 | 323.5-337 0.063 — RNA-binding protein 

— Pre-mRNA-splicing factor 

Cwf15 

— 2 universal hypothetical protein 
DPSCF300551 | 22.5-30.3 0.060 —__ butterfly hypothetical protein | 
DPSCF300134 | 135.5-172.5 0.059 —__ insect hypothetical protein | 
DPSCF300001 | 4697.5-4709 0.053 | 
DPSCF300190 | 213-236.5 0.052 — Collagen alpha-2(IV) 


— thioredoxin family Trp26 
— Selenoprotein T 
—__ Tetratricopeptide-like helical 


DPSCF300001 | 4622.5-4628.5 0.052 —_ kettin protein 
DPSCF300134 | 109.5-134.5 0.052 — potassium ion transport protein | 
DPSCF300074 | 526.5-532 0.052 | 
DPSCF300005 | 137-152 0.049 —__WD40 transcription factor | 
DPSCF300255 | 244-257.8 0.048 
DPSCF300014 | 176-202.5 0.047 — Golgin-80 (RabGTPase binding) 

— Phosphatidylserine synthase 

— WD4O protein 

— kismet DNA binding protein 
DPSCF300083 | 396-431 0.047 —_ Zinc finger DNA-binding domain | 
DPSCF300001 | 4725-4751 0.046 — forkhead protein transcription 

factor 

— Cyclic ion channel subunit 

DPSCF300005 | 223.5-264 0.045 —  Flotillin-1(insulin-signaling 


pathway) 
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Clonal dynamics of native haematopoiesis 


Jianlong Sun'??, Azucena Ramos', Brad Chapman“, Jonathan B. Johnnidis”, Linda Le', Yu-Jui Ho, Allon Klein’, Oliver Hofmann‘ 


& Fernando D. Camargo!” 


It is currently thought that life-long blood cell production is driven by the action of a small number of multipotent 
haematopoietic stem cells. Evidence supporting this view has been largely acquired through the use of functional assays 
involving transplantation. However, whether these mechanisms also govern native non-transplant haematopoiesis is 
entirely unclear. Here we have established a novel experimental model in mice where cells can be uniquely and genet- 
ically labelled in situ to address this question. Using this approach, we have performed longitudinal analyses of clonal 
dynamics in adult mice that reveal unprecedented features of native haematopoiesis. In contrast to what occurs follow- 
ing transplantation, steady-state blood production is maintained by the successive recruitment of thousands of clones, 
each with a minimal contribution to mature progeny. Our results demonstrate that a large number of long-lived pro- 
genitors, rather than classically defined haematopoietic stem cells, are the main drivers of steady-state haematopoiesis 
during most of adulthood. Our results also have implications for understanding the cellular origin of haematopoietic disease. 


Current dogma suggests that all haematolymphoid lineages are derived 
from a common ancestor, the haematopoietic stem cell (HSC)’*. During 
adult life, HSCs are thought to be the only bone marrow (BM) cell popu- 
lation capable of long-term self-renewal and multilineage differentia- 
tion'*. As HSCs divide, they produce multipotent and lineage-restricted 
progenitor populations, which are regarded as transient intermediates 
before the final production of functional blood cells’. Historically, the 
main experimental approach used to elucidate and define the cellular 
properties of various BM populations has been the transplantation assay. 
In this assay, prospectively purified cell populations are transplanted 
into myeloablated hosts. A general caveat to these approaches, however, 
is that only cells that are able to circulate, colonize a niche, and prolif- 
erate rapidly, will be able to produce detectable progeny. Additionally, 
given the extraordinary stress that transplanted cells endure during 
engraftment and the distorted cytokine milieu that they encounter, it is 
questionable to what extent their functional characteristics are shared 
with cells driving more physiological non-transplant haematopoiesis. 
Recent fate tracking approaches have proven to be fundamental in 
determining biological properties and clonal dynamics of solid tissue 
stem cells**. Owing to the unique physical organization of the blood 
system and the lack of HSC- or progenitor-restricted drivers, these ap- 
proaches have not been successfully applied to the study of native hae- 
matopoiesis. Because of this lack of tractable systems, the mechanistic 
nature of non-transplant haematopoiesis has remained largely unex- 
plored. Fundamental questions such as the number, lifespan and lin- 
eage potential of stem or progenitor cells that drive homeostatic blood 
production remain to be answered**. Here, we describe a novel experi- 
mental system to enable in situ labelling and clonal tracking of hae- 
matopoietic cells, and use it to investigate the cellular origins, lineage 
relationships and dynamics of native blood production. 


Clonal marking by transposon tagging 

Our experimental paradigm is based on the temporally restricted express- 
ion of a hyperactive Sleeping Beauty (HSB) transposase, an enzyme that 
mediates genomic mobilization of a cognate DNA transposon (Tn)’. In 
our model, a doxycycline (Dox)-inducible HSB cassette and a single-copy 
non-mutagenic Tn are incorporated in the mouse genome through gene 


targeting (Fig. la). HSB expression is controlled by a Dox-dependent 
transcriptional activator (M2), driven from the Rosa26 locus’®. In mice 
carrying these three alleles (referred to as M2/HSB/Tn), Dox adminis- 
tration results in HSB expression and subsequent Tn mobilization else- 
where in the genome. As Tn integration is quasi-random", every cell 
undergoing transposition will carry a single and distinct insertion site, 
which, upon Dox withdrawal, will serve as a stable genetic tag for the 
corresponding cell and its progeny (Fig. 1a). To monitor Tn transposi- 
tion, a DsRed reporter marks Tn mobilization by the concurrent removal 
of an embedded transcription stop signal (Fig. 1a). 

Tn mobilization could be induced in approximately 30% of the pheno- 
typically defined long-term (LT)-HSCs, short-term (ST)-HSCs, multipo- 
tent progenitors (MPPs) and myeloid progenitors (MyP)'*"** following 
3-4 weeks of induction, whereas no labelling was found in uninduced 
mice (Fig. 1b). When transplanted, DsRed* HSC/progenitors fully recon- 
stituted myeloid and lymphoid lineages for 10 months, indicating la- 
belling of bona fide LT-HSCs (Extended Data Fig. la-d). On the other 
hand, transplantation of DsRed” HSCs/progenitors produced fully DsRed™ 
progeny, confirming extremely low levels of transposition in the absence 
of Dox (Extended Data Fig. le, f). Analysis of uninduced older mice 
revealed minimal levels of spontaneous Tn mobilization in peripheral 
blood (PB) granulocytes (0.1%) and B cells (0.5%), two orders of mag- 
nitude lower than transposition levels observed in Dox-treated animals 
(Extended Data Fig. 1g). Peripheral T cells showed a higher degree of 
background mobilization (4.1 + 2.3%) (Extended Data Fig. 1g). Thus, 
the M2/HSB/Tn model allows strict Dox-dependent Tn mobilization 
in most of the haematopoietic compartment. 

As predicted, haematopoietic colonies grown in Dox semi-solid med- 
ium arising from sorted DsRed* stem/progenitor cells carried single and 
completely distinct insertion sites (Fig. 1c, Extended Data Fig. 2a, b, d). 
Secondary colonies from LT-HSC clones inherited identical Tn tags as 
their corresponding primary colonies, indicating stable propagation of 
Tn tags among progeny (Extended Data Fig. 2c, d). Evidence of Tn ‘re- 
mobilization’ in the absence of Dox was only found in one of 24 second- 
ary colonies analysed. Furthermore, no re-mobilized tags were observed 
in 80 single cells from secondary replatings (Extended Data Fig. 2d). 
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Figure 1 | Establishment of inducible transposon tagging approach. 

a, Transgenic alleles and strategy used for inducible genetic tagging. M2-rtTA, 
reverse tetracycline-responsive transcriptional activator; HSB, hyperactive 
Sleeping Beauty transposase; Tn, HSB transposon; STOP, polyadenylation 
signal; CAGGS, chicken B-actin promoter; TetO, tetracycline-response 
element. b, Frequency of DsRed* cells in long-term HSC (LT-HSC), 
short-term HSC (ST-HSC), multipotent progenitor (MPP), and myeloid 
progenitors (MyP) in marrow of M2/HSB/Tn mice exposed to Dox for 3 weeks. 
Shown are representative FACS plots from three independently analysed mice 
of similar age and induction period. c, Sequence of Tn tags identified from 
20 DsRed* LSK colonies that emerged following methylcellulose culture. 
gDNA, genomic DNA. 


We also established an improved PCR-based method to detect Tn 
tags in polyclonal samples with minimal cell number requirements. This 
combined whole-genome amplification (WGA)”* technology, three-arm 
ligation-mediated PCR (LM-PCR)"* and next generation sequencing 
(Extended Data Fig. 3, Additional Methods). Our method was sensitive 
enough to reliably detect clones with a frequency as low as 5—25 out of 
10,000 cells in a polyclonal population (Extended Data Fig. 4, Supplemen- 
tary Information). 


Clonal dynamics of native haematopoiesis 


Armed with a strategy for clonal and genetic labelling in situ, we began 
to examine the long-term clonal behaviours of HSC and progenitor 
clones by Tn tag interrogation in sorted granulocytes, B cells and T cells 
from PB samples that were periodically collected over a period up to 
12 months after Dox withdrawal (Fig. 2a, Extended Data Fig. 5, Sup- 
plementary Table 1). Given the ubiquitous expression of the Rosa26- 
M2 driver (Fig. 1), both primitive and differentiated haematopoietic 
cells can undergo transposition. Although this provides an unbiased 
approach to label the stem/progenitor pool, we allowed 3-4 months of 
‘chase’ before sample collection so that Tn tags in mature PB popula- 
tions would be more likely derived from longer-lived HSCs, as predicted 
from transplantation studies'*"” (Fig. 2a). 

Our initial analysis focused on the dynamics of granulocyte pro- 
duction given their rapid turnover rate'*. Among three independently 


ARTICLE 


analysed mice, a range of 65-905 clones per time point was routinely 
detected in sorted DsRed* granulocytes (Supplementary Table 2). Sur- 
prisingly, when analysed longitudinally, the vast majority of granulocyte 
tags (90-98%) were detected at single time points (Fig. 2b, c, Extended 
Data Fig. 6a, b, d, e). Moreover, the recurrent tags (found at more than 
one time point) clustered in adjacent time points (Fig. 2b, Extended 
Data Fig. 6a, d). In contrast, highly stable clones were readily detected 
in B and T cell samples (Extended Data Fig. 7a). Considering the sen- 
sitivity of our method (Extended Data Fig. 4c), these data argue against 
the existence of stable granulocytic clones producing more than 0.05- 
0.25% of the PB granulocyte pool during the chase period. This pre- 
dominantly transient and highly polyclonal contribution persisted up 
to 12 months of chase, suggesting that this pattern does not represent a 
transitory stage of clonal fluctuation’*”°. Clonal instability was also con- 
firmed by tag-specific nested PCR (Extended Data Fig. 7b). 

To examine whether limited PB sampling might underlie the observed 
lack of clonal stability, we asked whether ‘unstable’ PB clones could be 
detected in a much larger terminal sample comprising approximately 
80% of BM”’. This analysis revealed a clear inverse correlation between 
the number of PB clones found in the BM and the time elapsed since 
PB collection, a pattern highly indicative of limited lifespan (Fig. 2b, e, 
Extended Data Fig. 6g, h). Indeed, the fraction of persistent clones dropped 
exponentially with time, from which we could calculate that active gra- 
nulocytic clones had a detectable half-life of 3.3 weeks in PB (Fig. 2e, 
Extended Data Fig. 6h). A very minor subset of transient PB clones did 
reappear in the BM sample (Fig. 2b, Extended Data Fig. 6g). It is unclear 
whether this represents stochastic detection of minor stable clones or 
whether this reflects clonal re-activation. 

The observed pattern of clonal dynamics did not result from an arti- 
ficial increase in clonal complexity due to the 3-4-week induction per- 
iod, as similar clonal dynamics were observed in mice induced for one 
day (Extended Data Fig. 7c). Additionally, background Tn remobiliza- 
tion does not significantly contribute to our observations, as approxi- 
mately only seven Tn tags were detected in PB granulocytes of uninduced 
mice, compared to the several hundred clones found in Dox-treated 
animals (Extended Data Fig. 7d). Collectively, these data imply that long- 
term steady-state granulopoiesis is vastly polyclonal and largely driven 
by the successive recruitment of non-overlapping clones. 


Clonal diversity and lifespan 


The LM-PCR method currently applied is not quantitative, and is likely 
to underestimate the full clonal repertoire” (Extended Data Fig. 4g, 
Supplementary Information). To obtain a more representative view of 
clone size distribution and number, we performed single-cell LM-PCR 
analyses on sorted PB granulocytes (Fig. 3a). Among the total 290 single 
granulocytes analysed from an induced mouse at three consecutive time 
points, we detected 270 unique Tn tags. 254 of them were present in 
single granulocytes, 14 were observed twice and only 2 tags were found 
in three single cells (Fig. 3b). None of the tags was present in all three 
time points analysed (Fig. 3b). Single-cell analysis of another induced 
mouse at later time points revealed similar results (Fig. 3c). These find- 
ings confirm the extreme polyclonal nature of steady-state granulopoi- 
esis and provide support for the paucity of dominant or stable clones. 

Based on these single-cell data, we re-evaluated the number of clones 
present in PB granulocytes using statistical models of random sam- 
pling (see Methods), with the assumption that granulocyte clones are 
of uniform size. All time points provided very similar estimates for total 
clone number: 831 + 206 (mean + s.e.m.) (Fig. 3d, e). Considering that 
this analysis is restricted to only the approximately 30% DsRed* labelled 
cellular fraction (Fig. 1b), our estimate represents only a fraction of the 
clones that maintain granulopoiesis in a mouse at any given time. Addi- 
tionally, if we take into account that, at least monthly (our sampling 
interval), new clones are periodically recruited, our findings reveal an 
extraordinary amount of clonal complexity that is used to sustain long- 
term granulocyte production. 
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Figure 2 | Clonal dynamics of native haematopoiesis. a, Experimental flow 
chart showing longitudinal clonal analysis on FACS-sorted PB granulocytes 
(Gr), B cells, T cells, and BM Gr from induced mouse LL106. Tn tags are 
determined with the analysis pipeline described in Supplementary Methods. 
b, Distribution of Tn tags identified in PB Gr samples across multiple time 
points, lineages, and in BM. Each horizontal line represents a unique tag. 
Clones present exclusively in B cells, T cells or BM Gr are not shown. Bottom 
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panel shows subset of PB Gr tags found in multiple time points. c, Analysis 
showing the number of Gr tags that are either unique or recurrent in the Gr 
lineage. d, Analysis of the number of Gr tags that are either Gr-restricted or 
shared among B/T lineages. e, Extent of clonal overlap between PB Gr tags at 
different time points post chase and terminal BM Gr sample. Dashed line is an 
exponential fit to the data. 


Lineage output of haematopoietic clones 


We next compared Tn tags of granulocytes, B and T lymphocytes to 
determine lineage potential of the granulocyte-producing clones. Remark- 
ably, very few of the granulocytes tags were shared with either B or T 
cells in the PB (Fig. 2d, Extended Data Fig. 6c, f). This lack of common 
clonal origin was also observed when BM granulocytes and nascent pro/ 
pre-B cells were compared at multiple time points of chase (Extended 
Data Fig. 8a, b, Supplementary Tables 1 and 3), where only around 7% of 
BM granulocytes had the same clonal origins as nascent pro/pre-B cells 
(Extended Data Fig. 8c). Therefore, the bulk of granulocyte-producing 
clones are myeloid-restricted for up to 45 weeks. 

Wealso sought to determine the lineage potential of lymphoid clones. 
While only ~ 10% of the pro/pre-B tags are found in granulocytes at 9 
and 26 weeks, a much larger portion (~47%) is present in myeloid cells 
at 40-45 weeks post-induction (Extended Data Fig. 8d). These data 


Figure 3 | Polyclonal and fluctuating nature of native granulopoiesis. 

a, Experimental flow chart for the detection of Tn tags in single PB 
granulocytes. b, c, Single-cell-derived Tn tags from mouse AR1120 (b) and 
AR468 (c) at multiple time points of chase. Numbers in each box represent 
unique Tn IDs detected in single cells. Colour-coded boxes depict cells with 
recurrent tags. Red font depicts tags found at more than one time point. The 
analysis was performed on two induced mice and results of both are presented 
here. d, Probability distribution of the total number of clones in PB Gr of 
AR1120 at different time points (colour curves). Black curve shows the 
normalized product of the probabilities from all time points. e, Predicted clone 
number with 95% confidence interval (CI) in PB Gr of mouse AR1120 using the 
data from b and d (see Methods). 
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suggest that B-cell production shifts from a predominantly lymphoid- 
restricted progenitor to a multipotent progenitor after six months of chase. 
In contrast, monocytes, a myeloid cell type traditionally thought to share 
the same clonal origins as granulocytes’, had approximately 60% of 
their tags shared with granulocytes at all three time points, which con- 
firms the close relationship between these two lineages, and suggest 
that myeloid-producing clones are at least bi-potent (Extended Data 
Fig. 8d). 


Features of transplant haematopoiesis 


Our findings here starkly contrast with the clonal behaviour previously 
reported using retroviral barcoding techniques. In such experiments, a few 
dominant LT-HSC clones stably output multiple blood lineages’?”°***. 
These observations could be recapitulated in our model as a handful of 
stable and multipotent clones were observed in recipients of retrovirus- 
infected DsRed* Lin’ c-Kit* Scal* cells (Extended Data Fig. 9a, b, Sup- 
plementary Tables 4 and 5). Similar observations were obtained with 
transplantation of freshly isolated DsRed* Lin’ c-Kit* or LT-HSCs, 
although the clonal diversity was significantly increased, probably due 
to higher regenerative potential of less-manipulated cells (Extended Data 
Fig. 9e-h, k-m). Single-cell analysis of PB granulocytes of recipients 
confirmed the presence of dominant and stable clones (Extended Data 
Fig. 9c, d,i,j). Thus, our methodology is reliable enough to reveal stable 
and multipotent clonal behaviours. Our findings, therefore, demon- 
strate inherent and fundamental differences in the clonal dynamics of 
post-transplant and steady-state haematopoiesis. 


Cellular origins of haematopoietic clones 


Historically, LT-HSCs have been considered the major source of long- 
term haematopoiesis, although evidence for this in a non-transplant 
setting is limited’*. We then directly examined the extent of LT-HSC 
contribution during native blood production by two different appro- 
aches. First, we compared the clonal repertoire of resident BM granu- 
locytes in an M2/HSB/Tn mouse more than a year after Dox-induction 
with that of granulocytes and B cells derived after transplantation of such 
BM (Fig. 4a, Supplementary Tables 4 and 5). If classical transplantable 
HSCs drive steady-state granulopoiesis in the donor mouse, then the 
same tags would be recovered in the progeny of engrafted recipients. 
Only 5-8% of donor granulocyte tags were present in granulocytes or B 
cells in recipient mice, and almost all of these tags displayed transient 
engraftment (Fig. 4b, c). Two donor clones were detected in BM gran- 
ulocytes 73 weeks after transplant, but these were not detected in the 
LT-HSC and progenitor compartments in recipient mice (Fig. 4c). In 
contrast, many of the stable PB clones arising shortly after transplanta- 
tion were still actively producing multilineage progeny in BM one year 
later, and a subset of them clearly originated from LT-HSCs (Fig. 4b, c). 
This suggests that granulocyte production in situ for at least a year is 
not predominantly driven by BM cells with the capacity to engraft, but 
instead by progenitors with limited transplantation capacity. 

To further examine the ancestral relationships during native blood 
production, we determined clonal compositions of fluorescence-activated 
cell sorting (FACS)-purified LT-HSCs, MPPs and MyPs, and compared 
them with granulocytes, pro/pre-B cells, and monocytes from the same 
BM (Fig. 5a, Extended Data Fig. 10a). While approximately half of clones 
found in MyPs and MPPs were shared with mature populations, sur- 
prisingly, less than 5% of LT-HSC tags were also present in these mature 
cell types (Fig. 5b, c, Extended Data Fig. 10b, c). The extent of LT-HSC 
output does not increase if tags are compared to longitudinal PB gran- 
ulocyte and B-cell samples (Fig. 5c, d, Extended Data Fig. 10c). Remark- 
ably, we also found that less than 5% of LT-HSCs shared tags with MPPs 
and MyPs, traditionally considered their immediate downstream pro- 
geny (Fig. 5b, Extended Data Fig. 10b). These observations differ signi- 
ficantly from what occurs following transplantation, where many of the 
stable and dominant PB clones originated from LT-HSCs (Fig. 4b, c, 
Extended Data Fig. 9j). Taken together, these observations show that 
LT-HSCs have limited lineage output under unperturbed conditions 
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Figure 4 | Non-engraftable progenitors drive native haematopoiesis. 

a, Experimental flow chart used to compare clonal origins of native and 
recipient haematopoiesis. b, c, Tn tag analysis of cell populations from donor 
BM, recipient PB and recipient BM samples. Only the clones identified in 
granulocyte populations from donor BM and recipient PB are shown. Note 
stable, multilineage and HSC-derived haematopoiesis in recipients from clones 
not present in donor granulocytes. Two recipient mice were analysed: recipient 
1 (LL109) received femur BM (b); recipient 2 (LL113) received tibia BM (c). 


for at least 40 weeks, and that progenitors play a central role during native 
myelo- and lymphopoiesis. (Fig. 5b, e, Extended Data Fig. 10b, d) 

The detection of clonal overlap between MPPs and mature cell types 
allowed us to preliminarily interrogate lineage potential of MPPs at a 
clonal level. Our data provide definitive evidence for the existence of 
multipotent MPP clones (Fig. 5b, e, Extended Data Fig. 10b, d). However, 
in contrast to the transplantation model”, the majority of MPPs con- 
tribute predominantly to the myeloid lineage. 


Discussion 


We present here multiple lines of evidence demonstrating that, in an 
unperturbed system, classical LT-HSCs have a limited contribution to 
blood production during most of adulthood. This is surprising, con- 
sidering that during the period encompassed by our studies (~1 year) 
multiple LT-HSC divisions would have occurred’*”*”°. While our data 
cannot fully rule out potential stable contribution by LT-HSCs, this is 
likely to be lower than our detection limitation and relatively minor in 
comparison to that of MPPs. The absence of LT-HSC clones in other 
populations could alternatively be explained by a clonal ‘successive 
deletion’ model, in which HSCs would undergo symmetric differentia- 
tion cell divisions. While we cannot fully rule this out, we consider that 
this model is not sufficient to explain the source of extreme clonal com- 
plexity observed. 

Our results argue for a model where successive recruitment of thou- 
sands of both lineage-restricted and multipotent clones drives steady- 
state haematopoiesis for at least a year (Fig. 5f). In this model, a large 
number of progenitors are specified by early postnatal life (before the 
time of Dox labelling), after which there is limited contribution to this 
pool by LT-HSCs. These progenitors are likely to encompass cells tradi- 
tionally defined as ST-HSCs, MPPs and other populations with transient 
reconstituting activities, and their abundance (for example, > 100,000 
MPPs and >500,000 MyPs) could support the breadth of clonal divers- 
ity observed. Stochastically, a fraction of these clones can get recruited 
for blood production, where they undergo commitment and a massive 
proliferative burst to produce detectable PB progeny. Our findings of 


16 OCTOBER 2014 | VOL 514 | NATURE | 325 


©2014 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


a b ) 
r 
nESa2 

BM extraction FACS = O20 
in Forelimbs LT-HSC 
oS Hindlimbs ee 
Spine MyP 
M2/HSB/TIn => P' — 
mouse LL106 Ribs Granulocytes (Gr) 
40 weeks off Dox Sternum Monocytes (Mo) 
Pro/pre-B cells (B) 
d e 
ih 250 

£69 « LT-HSC —1_ Lymphoid 

oS = MPP 999 

2~ 50 = . MyP 3 

5D M 2 

2% 40 — 150 : 

@ 0 = Myeloid 

Zo 30 @ 100 

8 0 8 50 | 

ro) 0 0 Bipotent 

MPP subtypes 


oa 
S = 6 = a c PB BM 40 weeks 
= Granulocytes B cells 2 
eee | Chase aot 
(weeks): 13 18 24 29 33 39 40 13 18 24 29 33 39 40 G 3 ole 5 
f = x, & | Time 
Transplantation @ a & 
Native @ od = 95) ie 
LT-HSC Progenitor Mature cells 


HSC/MPP/MyP clones MPP-derived clones 


Figure 5 | LT-HSCs make a limited contribution to native haematopoiesis. 
a, Schematic for clonal analysis of BM populations. b, Distribution of identified 
Tn tags in LT-HSCs, MPPs, MyPs, granulocytes, monocytes, and pro/pre-B 
cells. Tags present in Gr, Mo, or B but not detected in any of the progenitor 
populations are not shown. MPP-derived clones are displayed on the right. 

c, Tn tags of ‘active’ LT-HSCs clones and their presence in downstream 
progenitors and mature cell types in BM and longitudinal PB samples. Clones 
are considered active if they share their Tn tags with at least one of the 


successive and polyclonal long-term behaviour are supported by irra- 
diation marking experiments”””* and by more recent studies involving 
in vivo lentiviral tagging”’. Similarly, variance analyses have predicted 
that haematopoiesis is maintained by a large number of haematopoie- 
tic clones**”’. 

One intriguing question that arises from our studies is whether clonal 
diversity or lifespan of progenitors will eventually exhaust in severely 
aged mice. Additionally, it will be important to perform follow-up clonal 
dynamic studies in the context of stress haematopoiesis. These studies 
will determine under what circumstances classically defined LT-HSCs 
engage in blood production in situ and which biological contexts deter- 
mine progenitor lifespan. It will also be important to determine the exact 
developmental and cellular origins of the observed long-lived progenitor 
clones. Our model will also be helpful in re-assessing classical haema- 
topoietic lineage hierarchies under more physiological conditions. 

Our data provide insight into the potential nature of the cell-of-origin 
of myeloid malignancies. It is currently thought that HSCs, given their 
known lifelong persistence, are ideal candidates as the target cells for 
oncogenic transformation”. In light of our data, the much larger num- 
ber oflong-lived progenitors may provide a more accessible pool of cells 
where oncogenic mutations may arise. Our transposon tagging approach 
could similarly be used to evaluate clonal dynamics and evolution in 
primary tumours. The modular nature of our system should enable 
cell-type-specific transposition, allowing clonal fate tracking of defined 
cell populations. Our work paves the way for future systematic and high- 
resolution analysis of clonal dynamics during development, ageing and 
multiple other biological processes. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Mice. The expression cassette of a hyperactive Sleeping Beauty (HSB) gene, and 
the HSB-responsive transposon element (Tn) were subcloned in the colla1 locus 
using FLP-mediated recombination, as previously described*’. A DsRed reporter 
gene, normally suppressed by the transcription polyadenylation signal between 
the inverted repeats of the Tn, was cloned downstream of the Tn element. Targeted 
embryonic stem cell clones were generated in KH2 lines and chimaeric mice were 
produced following published protocol”*. The HSB and Tn mice were intercrossed 
to create the compound transgenic M2/HSB/Tn mouse model. The resulted mice 
are of a mixed genetic background (C57BL/6] and 129/SvJ). 8-10-week-old male 
or female mice with the M2/HSB/Tn genotype were used in this study. To induce 
Tn mobilization, mice were fed with 1 mgml~' Dox together with 5 mg ml! suc- 
rose in drinking water until the desired level of labelling was achieved. 3-4 capil- 
laries of PB, which encompassed around 10% of the total blood of adult mice, were 
collected from the retro-orbital sinus every 4-6 weeks. BM cells were flushed out 
with 2% fetal bovine serum (FBS) in phosphate buffered saline (PBS) from dissected 
bones. CD45.1* mice were used as transplantation recipients (B6.SJL-Ptprca Pep3b/ 
Boy], stock # 002014, the Jackson Laboratory). All animal procedures were approved 
by the Boston Children’s Hospital Institutional Animal Care and Use Committee. 
Fluorescence-activated cell sorting (FACS). Cell populations from PB and BM 
were purified through FACS on FACSAria (BD Biosciences). The following com- 
binations of cell surface markers were used to define these cell populations: PB Gr, 
Ly6G*CD4 CD8 CD19  ; Bcells, CD4~ CD8- CD19"; T cells, CD4 CD8* CD19; 
BM Gr, Ly6G*7/4*B220°; monocytes, Ly6G_7/4°B220°; pro/pre-B cells, 7/4 — 
IgM B220*; LT-HSC, Lin” cKit* Scal *CD48- CD150*; MPP, Lin’ cKit* Scal* 
CD48tCD150°; myeloid progenitors, Lin IL7Ra~ cKit*Scal~. Lineage markers 
were composed of CD4, CD8, CD19, Macl, Grl, and Ter119. For MACS deple- 
tion, BM cells were first stained with biotin-conjugated lineage markers CD3e, 
CD19, Macl, and Terl19.Lin™ and Lin” cell populations were then separated with 
autoMACS Pro separator (Miltenyi Biotec) with manufacture’s depletion protocol. 
Commercially available antibodies were listed in Supplementary Table 6. Flow 
cytometry data were analysed with FlowJo (Tree Star). 

Methylcellulose colony formation assays. Tn-marked HSPCs or LT-HSCs were 
sorted from BM of induced M2/HSB/Tn mice as DsRed* Lin’ cKit"Scal* or 
DsRed* Lin” cKit*Scal* CD48” CD150* cells, respectively. Cells were cultured 
at clonal density in methylcellulose (Methylcellulose Base Medium, R&D Tech- 
nologies) supplemented with 10 ng ml ' recombinant murine G-CSF, 10 ng ml! 
SCF, and 10 ng Tpo. Single colonies were picked for Tn insertion tag analyses 12 
days after plating. 

Transplantation assays. Either fractionated or whole BM cells (CD45.2*) from 
induced M2/HSB/Tn mice were transplanted through retro-orbital injection with 
or without 1 X 10° whole BM cells (CD45.1*) into lethally irradiated C57BL/6 re- 
cipient mice (11.6 Gy of gamma-irradiation in a split dose with 2 h interval). Haema- 
topoietic stem and progenitor cells were transduced with retrovirus (pMIG, Addgene 
#9044) at multiplicity of infection of 1 in vitro for 24h before transplantation. The 
retrovirus was produced by transient transfection of the pMIG vector to the Phoenix- 
AMPHO packaging cell line (ATCC). Donor cell engraftment was determined at 
multiple time points following transplantation by PB flow cytometry analysis on 
LSR II (BD Biosciences). 

Whole-genome amplification (WGA). Cells of interest were sorted into 1.7 ml 
tubes and concentrated into 5-10 pl of buffer by low-speed centrifugation. For 
each sample, all the sorted cells were used for whole genome amplification with 
REPLI-g Mini kit (150025, Qiagen) according to manufacturer’s instruction. Amplified 
DNA was further purified by QlAamp DNA Micro kit (56304, Qiagen), and half of 
the elution was used for downstream analysis. 

3-Arm LM-PCR and sequencing. To increase the coverage of Tn insertion tags, 
300 ng of purified DNA was digested with three restriction enzymes (DpnlIl, Haelll, 
MspI), and then ligated with the corresponding DNA linkers. Ligation mixture 


were pooled and further digested with XbaI and KpnI to remove detection of Tn 
localized at donor site. Digested products were cleaned with MinElute Reaction 
Cleanup kit (28204, Qiagen), and the entire elute was used in primary PCR reac- 
tions with primers specific to Tn and linker sequences. The Tn-specific primer was 
biotinylated at 5’ end, which allowed enrichment of the PCR products by using the 
Dynabeads kilobaseBINDER kit (601-01, Invitrogen). PCR products were retrieved 
by incubation in 5 pl of 0.1 M NaOH for 10-20 min and 2 tll of it was further am- 
plified with nested primers in secondary PCR. The nest PCR primers contained 
adaptor sequences, with which the sequencing library was constructed directly from 
purified secondary PCR products. Solexa sequencing was carried out on HiSeq 2000 
(Illumina) at the Tufts Genomics Core. Sequences of PCR primers were listed in 
Supplementary Table 7. Raw and processed sequencing data will be available upon 
request. 

Identification and comparison of Tn insertion tags. The analysis script was 
developed in-house (Supplementary Information). NGS data were first filtered to 
retain reads containing Tn sequence followed by the characteristic TA dinucleotide 
sequence present at the Tn-genomic DNA (gDNA) junction. Linker sequence, if 
present, was trimmed along with the Tn sequence to obtain gDNA sequence for 
alignment against the mouse genome (NCBI37/mm9) with the BLAT algorithm. A 
positive alignment required a minimum of 17 nucleotides match with no mismatch 
allowed. To focus on unique insertion sites, non-mapped Tn tags and tags with 
multiple mapping sites were excluded from downstream analysis. To uniquely com- 
pare Tn insertion tags across multiple samples, we developed software that merges 
insertions (within 25 base pairs) from multiple experiments, normalizes by total 
read counts and filters low-frequency tags according to criteria described in Sup- 
plementary Information. 

Single-cell Tn insertion tag analysis. DsRed* granulocytes were sorted from 
blood as described above, from which single cells were sorted into 96-well PCR 
plates with 2 p11 PBS in each well. WGA was carried out directly from these single 
cells. Amplified DNA was digested, heat-inactivated, and ligated to the correspond- 
ing linker. Nested PCR was performed on the ligation product, and PCR products 
were analysed with conventional cloning and sequencing methods. 
Insertion-specific PCR. Nested PCR primers were designed based on genomic 
DNA sequences surrounding Tn insertion tags as identified in high-throughput 
sequencing. Singleplex PCR reactions were carried out for the individual clones 
by using insertion-specific primers along with one of the transposon primers. 
Establishment of HEK293 clones with stable Sleeping Beauty transposon 
insertion sites. HEK293 cells were obtained from R. Gregory (Boston Children’s 
Hospital). The cells were transfected with the transposon-targeting vector. Stable 
clones were selected with neomycin for two weeks. The copy numbers of these 
stable clones was determined based on quantitative PCR of NeoR gene imbedded 
in the transposon vector. An HEK293 clone with a single copy of stably integrated 
transposon vector was selected, and further transfected with HSB-expressing vec- 
tor to induce Tn mobilization. To terminate transposition, we propagated the trans- 
fected cells three times while the HSB-expressing vectors were gradually lost. The 
DsRed* HEK293 cells that have undergone Tn transposition were enriched by 
FACS and grew at clonal density. Ten DsRed* colonies were picked and LM-PCR 
and Sanger sequencing were used to determine Tn insertion tags. To assemble poly- 
clonal samples, cell sorting was used to mix the same number of cells from each 
clone. Duplicate admixtures were prepared at six cell dosages: 1, 5,25, 100, 500 and 
2,500 cells. 10,000 PB cells from an induced M2/HSB/Tn mouse were added to the 
individual sample to further improve the clonal complexity. The resulting polyclonal 
samples were then processed in the same manner as blood samples for Tn insertion 
tag analysis. 


33. Beard, C., Hochedlinger, K., Plath, K., Wutz, A. & Jaenisch, R. Efficient method to 
generate single-copy transgenic mice by site-specific integration in embryonic 
stem cells. Genesis 44, 23-28 (2006). 
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Extended Data Figure 1 | Characterization of M2/HSB/Tn mouse model. donor-derived PB granulocytes, B cells, and T cells. e, Experimental flow chart 
a, Experimental flow chart showing transplantation of DsRed* Lin” cKit" BM __ showing transplantation of DsRed* Lin” cKit* or DsRed Lin” cKit* BM cells. 


cells from induced M2/HSB/Tn mice (CD45.2*) into lethally-irradiated f, Longitudinal follow-ups of DsRed expression in donor-derived PB cells. 3 and 
recipient mouse (CD45.1"). b, Longitudinal follow-up of donor-derived PB 4 mice received DsRed and DsRed* donor cells, respectively. g, Fraction of 
cells in 5 recipient mice. c, Representative dot blots showing percentage of DsRed* cells in PB granulocytes, B cells and T cells from 6-8-month-old 
donor-derived (CD45.2*) granulocyte, B cells and T cells 42.5 weeks after induced (n = 6) and uninduced (m = 4) M2/HSB/Tn mice. Mean = s.d. is 
transplantation. d, Longitudinal follow-ups of DsRed expression in shown. 
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Extended Data Figure 2 | Stable propagation of Tn tags during in vitro 
expansion of LT-HSC clones. a, Experimental flow chart showing primary 
and secondary colony-formation assays and Tn tag analyses. b, Results of LM- 
PCR analysis on primary LT-HSC colonies. M, 100-bp DNA ladder. The two 
PCR products detected from colony no. 2 and 3 resulted from LM-PCR 
amplification of both ends of single Tn insertion sites. c, Results of LM-PCR 
analysis on secondary colonies from two of the primary colonies. Identities of 


the PCR products in b and c were determined by cloning and Sanger 
sequencing. Arrows indicate PCR products of Tn tags identified in parental 
colonies. Bands marked by white asterisks are PCR artefacts, which are defined 
by the absence of transposon element or uniquely aligned genomic DNA 
sequence. White arrowheads depict de novo Tn tags. d, Summary of Tn tags 
identified in primary colonies, secondary colonies, and single-cell analysis. 
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G) Dox induction 


@) Periodic sampling of PB and cell sorting 
Approximately 10-15% of total PB is sampled every 4-6 
weeks. DsRed+ granulocytes, B cells, and T cells are obtained 
by FACS. 


Samples from unrelated mice are processed 


GB) Whole genome amplification 
side-by-side to control for the level of cross-contamination. 


Enzyme digestion 
Amplified DNA are digested with three different restriction enzymes. 


Ligation 
The digested products are purified and ligated to the corresponding 
linkers (L1, L2, L3). 


Secondary digestion 
Donor site for transposition (DST) contains recognition 
sequences of restriction enzymes Xbal and KpnI at upstream and 

G) downstream terminuses of transposon. To prevent the detection of donor 
site of transposon (DST), the ligation mixtures are pooled and further 
digested with Xbal and KpnI. Most induced Tn insertion sites do not expect 
to have recognition sequences for these two rare cutters in their surrounding 
regions, and therefore are preserved in this secondary digestion step. 


The Tn primer is labeled with biotin for enrichment in step 8. 


@ First-round nested PCR 


Enrichment 


Amplified PCR products are enriched by streptavidin-coated 
magnetic beads. DNA is recovered from the beads by incubating in 
0.1N NaOH for 5 minutes. 


Second-round nested PCR 

The enriched PCR products are amplified with nested PCR 
@) primers, which contains sequences of Solexa sequencing primer 

and Illumina index sequencing primer. 


Indexing PCR 
Nested PCR products are further amplified with 
(10) primers containing flow cell attachment adaptors and 6-bp index sequences. 


Dinstinct indexes are used for the individual samples. 


Sequencing 
Indexed sequencing libraries are mixed at equal molar concentration and 
subject to high throughput sequenicng on Illumina’s HiSeq 2000. 


G2) Tn tag identification and comparison 


Extended Data Figure 3 | Flow chart showing experimental procedures of Tn tag labelling and detection. 
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Extended Data Figure 4 | Characterization of methodology for Tn tag 
detection. a, A representative plot showing read frequencies of Tn tags 
detected in a test sample (shown on x axis), and their frequencies observed in 
control samples from an unrelated mouse (shown on y axis). Each circle 
represents a unique Tn tag. The dashed line depicts 50-read cutoff. Tags in the 
red box are high-confidence reads selected for further analysis. b, Detection 
sensitivity of linear amplification-mediated PCR (LAM-PCR) and ligation- 
mediated PCR (LM-PCR). Serial dilutions of genomic DNA from a transposon 
mouse are used as input. c, Sensitivity of Tn tag detection from polyclonal 
samples using LM-PCR. The polyclonal samples are assembled by mixing 
10,000 DsRed* PB cells and different numbers of each of ten HEK293 clones. 
The Tn tags in these HEK293 clones were pre-determined. Six cell dosages 


25 100 
Input Cell Number 


500 2500 


(1, 5, 25, 100, 500 and 2,500 cells) are tested in duplicates for each clone. A 
positive call for the detection of the known Tn tags is determined based on 
criteria defined in Supplementary Information. d, Read frequencies between 
the duplicate samples in ¢ are positively correlated. Each circle depicts a Tn tag 
from one of the seven HEK293 clones at a particular cell dosage. e, Venn 
diagram showing additional technical LM-PCR repeats performed on PB Gr 
split samples of mouse AR1122 collected at 12, 18 and 23 weeks after Dox 
withdrawal. Shown in plots are the number of Tn tags that are either commonly 
or uniquely detected in each of the repeats. f, Plots showing read frequencies of 
Tn tags described in e. g, Broad distribution of read frequencies among different 
HEK293 clones with same input cell numbers. Averages of the duplicate 
samples are shown. 
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Extended Data Figure 5 | Purification of PB granulocytes, B cells, and T b, DsRed™ gates are established based on PB samples from uninduced M2/ 
cells by FACS. a, Schematic for FACS purification and purity analysis of HSB/Tn mice. 
DsRed* PB granulocytes, B cells, and T cells from induced M2/HSB/Tn mice. 
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Extended Data Figure 6 | Clonal dynamics in PB samples of additional 
induced mice. Data are presented in the same manner as Fig. 2. a-c, Tn tags 
from mouse A384; d-f, Tn tags from mouse AR1123. Tags unique to B or T cells 
are not shown. g-h, Tn tags from mouse AR1121. The terminal PB sample 


shown in panel g encompasses approximately 50% of the blood, and the BM 
sample are from forelimbs, hindlimbs, spine, sternum and ribs. k, The 
percentage of recurrent Tn tags in prior PB samples when compared with that 
in the BM granulocyte sample. 
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Extended Data Figure 7 | Validation of results obtained in longitudinal c, Longitudinal PB analyses of 1-day-induced mice (LL91 and LL145). d, Tn tag 
analyses. a, B cells and T cells Tn tags that are present in 4 or more PBsamples numbers in PB granulocytes collected from 10-16-month-old uninduced 
from induced mouse LL106. b, Results of nested-PCR analysis of PB mice and from all time points shown for induced mice LL106, AR384 and 
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Extended Data Figure 8 | Lineage relationships among BM granulocytes, 
monocytes and pro/pre-B cells. a, FACS plots showing purification scheme 
of BM granulocytes, monocytes and pro/pre-B cells. Monocytes and pro/ 
pre-B cells are double-sorted to minimize granulocytes contamination. 
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points after Dox withdrawal. c, Percentage of granulocyte Tn tags that are 
shared with pro/pre-B cells. Each column represents data from an individual 
mouse or a single bone. d, Percentages of pro/pre B cell clones and monocyte 
clones that share Tn tags with BM granulocytes. Each column represents data 
from an individual mouse or a single bone. n/a, not available. 


©2014 Macmillan Publishers Limited. All rights reserved 


a 
& sy On Dox for 
1 week — > 
Meer Tn FACS: DsRed* 
Gna5 2 LSK BM cells 
b 
AR1001 Gr B cells T cells 
Weeks after 
transplant: 10 15 20 10 15 20 10 15 20 
All Gr tags — 
d Gr B cells T cells 
Weeks after oO 
transplant: 10 15 20 10 15 20 10 15 20 35 60 
PB pool analysis Single cell 
analysis 
e 
ge at On Dox for 
es 1 week 
M2/HSB/Tn : 
FACS: DsRed* 
Coane LT-HSC or Lint 
cKit* BM cells 
f — aRsse6 Gr B cells T cells g 
Weeks after 
transplant: 6 11 #16 6 14 16 6 11 #=16 
All Gr tags 


Infection 


ay 


> © > Ty 


ARTICLE 


x 4 
Viral __ & >, 


CD45.1 FACS: Gr, B, T cells 


Weeks after 


transplant: Tn tags in single granulocytes 
35 
15 | 6 {7/8 | 9 | 10] 11 | 12 | 
60 


| 24 | 25 | 26 | 27 
rst [9596 [sr [96 [ 9940 [a 
4 


Pe 


CD45.1 FACS: Gr, B, T cells 


Gr tags 


|. 8, — 3time points 


on — 2 time points 


Unique Recurrent Gr-restricted Multi-lineage 
i Weeks after Gr Bcells T cells Gr B HSC 
transplant: Tn tags in single granulocytes Weeks after 6746 611166 111625. 33 
transplant: 
25 
MEI : 
ro : 
PB SC BM 
k arR541. = Gr B cells T cells 
Weeks after Gr tags m Gr tags 
transplant, 19 23 27 32 19 23 27 32 19 23 27 32 
M68 — 4 time points ~~ 
—— —= » Ea — 3 time points A wil — Gr/BIT 
All Gr tags — 2 time points 29 —Gr/B 
es a Gr/T 
16 
= Unique Recurrent Gr-restricted Multi-lineage 


Extended Data Figure 9 | Clonal analysis of haematopoiesis under 
transplantation conditions. a, Experimental flow chart showing viral 
infection of donor cells and longitudinal analysis of clonal dynamics in the 
transplant mouse. 2,000 DsRed* LSK cells were transduced with retrovirus in 
the presence of TPO, Flt3 and SCF for 2 days and transferred to lethally 
irradiated recipients in the presence of 110° wild-type bone marrow cells. 
b, Distribution of PB Gr tags and their presence in B cells and T cells from 
recipient mouse AR1001 at three time points following transplantation. 

Tn tags unique to B cells or T cells are not shown. c, Single-cell analysis 

of PB granulocyte Tn tags from mouse AR1001 at 35 and 60 weeks after 
transplantation. d, A subset of dominant clones revealed in single-cell analysis 


(c) are stable in PB. e, Experimental flow chart showing purification and 
transplantation of LT-HSCs or Lin” cKit* BM cells from induced M2/HSB/Tn 
mice. 4 X 10* DsRed* LT-HSCs or 5 X 10* DsRed ‘Lin’ cKit* cells per 
recipient mouse were used. f-h and k-m, Distribution, recurrence, and 
lineage potential of PB Gr clones from recipient mouse AR856 receiving 
LT-HSC donor cells (fh) and mouse AR541 receiving Lin” cKit* donor cells 
(k-m). Data are presented in the same manner as Fig. 2b-d. i, Single-cell 
analysis of granulocyte Tn tags from mouse AR856 25 weeks after 
transplantation. j, The dominant clone identified in single-cell (SC) analysis 
(clone no. 1 in i) is persistently detected in PB and BM from a single femur 
at 33 weeks. This clone is also detected in the LT-HSC compartment. 
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Structural mechanism of glutamate 
receptor activation and desensitization 


Joel R. Meyerson’, Janesh Kumar’, Sagar Chittori?, Prashant Rao!, Jason Pierson®, Alberto Bartesaghi', Mark L. Mayer? 


& Sriram Subramaniam! 


Ionotropic glutamate receptors are ligand-gated ion channels that mediate excitatory synaptic transmission in the 
vertebrate brain. To gain a better understanding of how structural changes gate ion flux across the membrane, we 
trapped rat AMPA (a-amino-3-hydroxy-5-methyl-4-isoxazole propionic acid) and kainate receptor subtypes in 
their major functional states and analysed the resulting structures using cryo-electron microscopy. We show that 
transition to the active state involves a ‘corkscrew’ motion of the receptor assembly, driven by closure of the ligand- 
binding domain. Desensitization is accompanied by disruption of the amino-terminal domain tetramer in AMPA, but not 
kainate, receptors with a two-fold to four-fold symmetry transition in the ligand-binding domains in both subtypes. 
The 7.6 A structure of a desensitized kainate receptor shows how these changes accommodate channel closing. These 
findings integrate previous physiological, biochemical and structural analyses of glutamate receptors and provide a 


molecular explanation for key steps in receptor gating. 


Ionotropic glutamate receptors (iGluRs) are major mediators of excit- 
atory synaptic transmission in the central nervous system and have a 
vital role in mediating memory and learning’. The AMPA, kainate and 
NMDA (N-methyl-D-aspartate) receptor subtypes work by opening a 
cation-selective pore in response to ligand binding, a key step in inter- 
cellular communication in the nervous system. Channel opening is fol- 
lowed by receptor desensitization that closes the channel, with both sets 
of reactions occurring over a millisecond timescale’. High-resolution 
crystallographic studies of isolated amino-terminal domain (ATD) and 
ligand-binding domain (LBD) dimers, coupled with decades of bio- 
chemical and functional studies, have provided important insights into 
the structure and function of these receptor components*”, while crys- 
tallographic analysis of the closed state of a modified form of the AMPA 
receptor (GluA2.1ys:) has enabled delineation of domain organization 
and transmembrane structure in the context of the tetrameric receptor 
assembly in an antagonist-bound closed state’. 

Understanding the structural basis of the transition from closed to 
active and desensitized conformations is central to deciphering iGluR 
function in health and in disease. However, no structures of either active 
or desensitized conformations have been reported. Given that a number 
of earlier studies provide hints of extensive conformational variability 
in closed, active and desensitized states’, including the ability of sub- 
units to move independently during the activation process’’, it seems 
likely that trapping functionally relevant states of native receptors in 
the context of three-dimensional (3D) crystals may be challenging”’. 
Previous structural studies ofa full-length kainate receptor (GluK2) at 
~20 A resolution using cryo-electron tomography and subvolume aver- 
aging suggested that desensitization involves marked structural changes 
in the LBD, with minimal changes in the ATD”’. These findings are in 
contrast to the extensive quaternary rearrangements of the ATD tetra- 
mer assembly observed in earlier single-particle negative stain analyses 
on AMPA receptors at ~40 A resolution’. However, neither of these ana- 
lyses was at resolutions high enough to provide a molecular interpreta- 
tion of the underlying domain movements involved. To determine how 
glutamate receptor ion channels accommodate the structural changes 


necessary for activation and desensitization, we carried out single- 
particle cryo-electron microscopy (cryo-EM) ofboth the AMPA recep- 
tor GluA2 and the kainate receptor GluK2. By solving structures in 
multiple conformational states (summarized in Extended Data Table 1a), 
we address the extent and nature of structural changes that occur in 
AMPA receptors during activation and desensitization, and compare 
the results with structural analysis of the GluK2 desensitized state by 
single-particle cryo-EM at ~7.6 A resolution. Our results provide a 
detailed glimpse into the overall gating cycle of glutamate receptors, an 
evaluation of the similarities and differences in conformational changes 
observed in AMPA and kainate receptor families, and a molecular mech- 
anism for the marked LBD movements that occur during the receptor 
gating cycle. 


Antagonist-bound closed state GluA2 structure 


Toestablish the feasibility of solving iGluR structures with single-particle 
cryo-EM, we first pursued structural studies of fully glycosylated GluA2 
with a wild-type ATD-LBD linker (referred to as GluA2,,,) trapped 
in the closed state with 0.3 mM ZK200775, a high-affinity competitive 
antagonist’*. The 3D structure of GluA2,,,, determined by single-particle 
cryo-EM ata resolution of ~10 A, estimated by the gold standard 0.143 
Fourier shell correlation (FSC) criterion’*, demonstrates an overall orga- 
nization similar to that reported for GluA2.,ys¢ (Fig. 1 and Extended 
Data Figs 1 and 2). The two-fold symmetric dimer of dimers arrange- 
ment of the ATD and LBD, and the domain swap across distal and prox- 
imal subunits, are all clearly observed. In the transmembrane domain, 
similar to the X-ray structure of GluA2.,y<t (ref. 6), density for o.-helices 
pre-M1, M1, M3 and M4 are less well resolved (Extended Data Fig. 2). 
To obtain a molecular interpretation of the cryo-EM density map, coor- 
dinates for two ATD dimers, two LBD dimers and the transmembrane 
regions derived from GluA2,,ys¢ (Protein Data Bank (PDB) accession 
3KG2) were fit as five independent rigid bodies (Fig. 1). This revealed 
excellent agreement with the crystal structure, but with an increase in 
separation between the ATD and LBD, which are ~8 A further apart 


in GluA2em than in GluA2.y.¢ (Extended Data Fig. 3). We also founda 
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change in angle between LBD dimer pairs, from 139° in GlUA2 cyst to 
144° in GluA2.., and an increase in separation between proximal AC 
subunits of ~5 A as measured at the top of the LBD in GluA2,,,. We 
conclude that deletion of six residues in the ATD-LBD linker, perhaps 
coupled with crystal packing forces, result in subtle conformational 
changes, and creation of a buried interface in GluA2,,,. that is absent 
in native AMPA receptors (Fig. 1f). 


Structure of GluA2 in the active state 


To determine structural changes that occur with transition to the active 
state, purified GluA2 was pre-mixed with 0.5 mM LY451646, a potent 
allosteric modulator that prevents entry into the desensitized state’®. 
After equilibration for 30 min, a saturating concentration of glutamate 
(100 mM) was added to activate ion channel gating, followed by imme- 
diate plunge freezing. Under these conditions there is very high occu- 
pancy of the open state’’, and the activation of subconductance states 
which are prominent at low agonist concentrations is reduced"*. Analysis 
of molecular images obtained from AMPA receptors in the active state 
revealed the presence of well-defined 2D class averages (Fig. 2a), allow- 
ing reconstruction of the structure to a resolution of ~ 12 A witha set of 
images of similar size and quality to that used to obtain the structure of 
the closed state (Extended Data Fig. 4). The slightly lower resolution 
suggests that despite the presence of glutamate at a high concentration, 
the active state may be more conformationally variable than the closed 
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Figure 1 | GluA2 purification imaging and the 
antagonist-bound closed state structure. a, FSEC 
(fluorescence size exclusion chromatography) 
profile for GluA2.., showing a monodisperse 
profile for tryptophan fluorescence in units of 
counts per second (c.p.s.); the inset shows an 
SDS-PAGE gel with molecular weight (MW) 
markers in kDa for pooled fractions following 
IMAC (immobilized metal affinity 
chromatography) purification, after thrombin 
cleavage to remove the GFP fusion protein, and 
following preparative SEC. b, c, Representative 
power spectrum (solid line) overlaid with the 
computed contrast transfer function (dashed line) 
(b) for a cryo-EM image (c), with insets 
highlighting images of individual GluA2 ZK200775 
complexes (scale bar, 100 nm). AU, arbitrary units. 
d, Representative 2D class averages from the 
initial classification of 40,709 projection images. 

e, Isosurface representation of the GluA2 closed 
state cryo-EM structure at ~10A resolution 
segmented to show distal AC (green and blue) and 
proximal BD (red and yellow) subunits with 
GluA2<,ys¢ (PDB 3KG2) coordinates for the ATD, 
LBD and transmembrane regions fit separately as 
rigid bodies; the dashed lines highlight putative 
membrane boundaries. f, Illustration of the region 


GlUA2 ast <8 of the LBD layer that is in close contact with the 


ATD in GluA2.,y<¢ (top panel, cyan shading) 
that is not observed for GluA2.., (LBD layer of 
experimental cryo-EM density map, and 
corresponding fits, are shown in the middle and 
bottom panels, respectively). 


(fitted) 


state, perhaps due to the occurrence of subconductance states, or tran- 
sient excursions to a closed state. Nevertheless, two ATD dimers (PDB 
3KG2) and two LBD dimers (PDB 1FTJ) were fit as rigid bodies with- 
out ambiguity (Fig. 2b), supported by identification of secondary struc- 
ture elements (Extended Data Fig. 4). Because ATD and LBD crystal 
structures provide information at atomic resolution, combining this with 
the quaternary constraints provided by cryo-EM density maps allows 
interpretation of structural changes in the full-length receptor at reso- 
lutions higher than the nominal resolution of the density map. Although 
the transmembrane domain is not resolved with sufficient detail to allow 
interpretation of the conformation of transmembrane helices, rigid- 
body fits of ATD dimers and glutamate-bound LBD dimer crystal struc- 
tures are sufficiently well constrained by the density map to allow a 
molecular interpretation of the activation mechanism under conditions 
where GluA2 has a high open probability (Fig. 2b). 

Comparison of the closed and active state density maps reveals LBD 
‘clamshell’ closure (Fig. 2c), as seen for isolated LBD dimers"’. Further- 
more, a ~7 A vertical contraction of the ATD-LBD assembly is observed, 
measured as a downwards movement at the top of the ATD tetramer, 
as well as unanticipated movements in the LBD, in which the dimer 
pairs rotate about an axis offset from the local axis of two-fold symmetry 
(Fig. 2d). The coordinate fits show unambiguous evidence for differ- 
ential vertical displacement of the proximal A and C subunits, the upper 
lobes of which move down by ~ 10 A, as compared toa ~4 A movement 
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of the distal B and D subunits (Fig. 2d). When viewed perpendicular to 
and into the plane of the membrane, the upper surface of the LBD tet- 
ramer assembly rotates anticlockwise, and the lower lobes separate (Fig. 2e). 
The net result of these movements is a novel corkscrew-like rotation 
that drives the transition from the closed to the active conformation (Sup- 
plementary Video 1). In contrast to previously reported models for the 
active state that predicted a substantial decrease in separation of the 
proximal subunits at the top of the LBD tetramer assembly”, we find 
instead a small increase in proximal subunit separation that accompan- 
ies the corkscrew motion, and a 30° increase rather than a decrease’®”® 
or no change’ in the angle between dimer pairs (Extended Data Table 1b 
and Supplementary Video 3). 

The off-axis LBD dimer assembly movements described above trig- 
ger asymmetric rearrangements of the LBD-transmembrane linkers also 
not revealed in models generated previously either by replacement of 
LBD dimers in GluA2.,y.¢ with glutamate-bound LBD dimer assemblies’, 
or by generating tetramer assemblies using crystallographic symmetry 
operations for LBD dimer structures'*”’. The largest conformational 
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Figure 2 | Structural changes 
accompanying opening of GluA2. 
a, Representative 2D class averages of 
GluA2,,, in the active state after 
initial classification of 31,637 
projection images. b, GluA2.,,, active 
state structure shown in isosurface 
representation, fitted with ATD 
dimers (PDB 3KG2) and glutamate- 
bound LBD dimers (PDB 1FTJ) with 
the transmembrane region covered 
by micellar density. c, Density maps 
for a single subunit showing the 
visible difference between the 
antagonist-bound open cleft 
conformation (top) and the 
glutamate-bound closed cleft 
conformation (bottom) of the LBD 
‘clamshell’; the right-hand panel 
shows the corresponding coordinate 
fits. d, Ribbon and cylinder diagrams 
for GluA2 coordinates fit to the 
closed (magenta) and active (blue) 
states reveal a ~7 A downward 
displacement of the ATD in the 
active state (top), with proximal and 
distal subunit LBD dimer assemblies 
viewed parallel to (middle) and 
perpendicular to (bottom) the 
membrane. Black dashed lines show 
the approximate planar interface 
between subunits in the dimer 
assembly. e, Isosurface views of LBD 
tetramer region density maps fit with 
LBD dimers in closed (left) and 
active (right) states. Coloured dots 
identify the locations of Co atoms for 
Val 395 (upper lobe) and Ala 665 
(lower lobe). f, Movement of the S1- 
M2 linker (Lys 505), M3-S2 linker 
(Glu 634) and S2-M4 linker 

(Gly 771) shows how LBD tetramer 
movements drive channel 

opening; arrows show the direction 
of movement from closed to 

active states. 


change occurs at the end of the M3-S2 linker, for which there is a 33° 
anticlockwise rotation of the proximal AC subunits, coupled with a 
clockwise rotation by 20° of the distal BD subunits viewed perpendic- 
ular to the plane of the membrane. In addition to the large increase in 
separation of the M3-S2 linkers within each dimer pair, as first predicted 
by crystallographic studies on isolated iGluR LBD dimer assemblies", 
we also found substantial vertical movements of the M3-S2 and $2- 
M4, but not S1-M1, linkers, accompanied by a decrease in separation 
of the AC and BD subunit dimer assemblies for all three sets of linkers 
(Fig. 2f). For the $1-M1 linker, the dimensions of the twisted parallel- 
epiped connecting the four subunits changed from 30 X 38 A in the 
closed state to 40 X 34 A in the active state, with minimal vertical move- 
ment. For the M3-S2 linker, there is a ~~6 A downwards movement of 
the distal but not proximal subunits, with a change in parallelepiped 
dimensions from 14 x 40 A in the closed state to 29 X 36 A in the active 
state (Fig. 2f). The S2-M4 linker also undergoes 8 A and 3 A downward 
translations for the proximal and distal subunits, accompanied by a 
change in parallelepiped dimensions from 40 X 47 A in the closed state 
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to 38 X 41 A in the active state. By contrast, in previous models for AMPA 
receptor active states generated using crystallographic symmetry op- 
erations for isolated LBDs’°”, vectors connecting the LBD-ion chan- 
nel linkers are arranged as planar arrays, without the characteristic 
twist observed in the activate state EM structure and in the open state 
model based on GluA2crys¢ (refs 6, 12) (see detailed comparison in Ex- 
tended Data Table 1b and Supplementary Video 3). Overall, analysis 
of the GluA2,,, map reinforces the idea that glutamate-triggered move- 
ment of the M3-S2 linker unwinds the M3 helix bundle, suggesting 
how the overall quaternary structural changes enable opening of the 
ion channel. 


GluA2 desensitization mechanism 

AMPA and kainate receptors exhibit rapid and nearly complete desen- 
sitization of ion flux within milliseconds after glutamate binding’**!. To 
trap GluA2,,,, in the desensitized state, we incubated the purified pro- 
tein with 1 mM quisqualate, a full agonist with a dissociation constant 
(Ka) of ~20 nM and tenfold higher affinity than glutamate. Analysis 
of cryo-electron microscopic images revealed evidence of substantial 
conformational heterogeneity (Fig. 3a) precluding determination of a 
single desensitized state 3D structure. Three-dimensional classification 
enabled separation of three dominant classes at nominal resolutions of 
21 A, 23 Aand 26 A with variable degrees of displacement between ATD 
dimers compared to the closed and active states (Fig. 3b and Extended 
Data Fig. 5). In all three classes, the LBD layer separates into four lobes 
of density, with different degrees of separation between the proximal 
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Figure 3 | Conformational ensemble of desensitized GluA2. 

a, Representative desensitized state GluA2.m 2D class averages from initial 
classification of 35,083 projection images. Selected class averages that 
illustrate the range of observed conformations are highlighted. b, Segmented 
isosurface representations of three distinct desensitized state GluA2.., 
structures, with the ATD and LBD layers identified in blue and orange, 
respectively. c, Top views of ATD and LBD layers for the three GluA2. 
desensitized states (middle columns) flanked by those from the active state 
(left) and restored active state (right). 
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and distal LBD subunits, strikingly different from the dimer-of-dimers 
structure found in the closed and active states. This variability in ATD 
and LBD conformation is further illustrated in top views that capture 
the extent of the quaternary structural change in the three desensitized 
states as compared to the active state (Fig. 3c). It is likely that the three 
desensitized states are subsets of an even larger spectrum that includes 
additional weakly populated conformational variants. Nevertheless, our 
findings establish that desensitization results in separation of the LBD 
dimers into a quasi-four-fold arrangement, coupled with conformational 
heterogeneity in the ATD layer not observed for either the closed or 
active states. 

To determine whether conformational variability observed for 
quisqualate-GluA2,,,, complexes reflects the properties of functional 
receptors, we tested whether subsequent addition of 0.5 mM LY451646 
to the same preparation used for cryo-EM analysis of the desensitized 
state would restore a homogeneous active state conformation. Structural 
analysis at ~16 A resolution of GluA2.., receptor suspensions treated 
this way demonstrates that the active conformation is indeed restored 
(Fig. 3c and Extended Data Fig. 6). Our experiments thus establish that 
in the desensitized state, quisqualate-bound GluA2 is fully functional 
and capable of undergoing conversion to the active state. 


GluK2 desensitized state at subnanometre resolution 


The GluK2 desensitized state map we reported previously, using cryo- 
electron tomography and subvolume averaging”, was not at sufficiently 
high resolution to delineate the structural changes that underlie desen- 
sitization (Extended Data Fig. 9). Furthermore, the analysis presented 
above indicates that obtaining a high-resolution structure of the desen- 
sitized state of GluA2 is likely to be technically challenging owing to the 
intrinsic conformational mobility of the ATD and LBD. GluK2, how- 
ever, seemed like a more promising candidate given that subvolume 
classification of the tomographic data suggested a high degree of con- 
formational homogeneity’, consistent with the 100-fold greater sta- 
bility of the GluK2 desensitized state revealed by electrophysiological 
analysis’*’. We therefore carried out single-particle cryo-EM analysis 
of GluK2 stabilized in the desensitized state by incubation with 2S,4R- 
4-methylglutamate, a full agonist that binds with 100-fold higher affin- 
ity than glutamate”’. 

In the structure of the GluK2 desensitized state, determined at ~7.6 A 
resolution (Fig. 4a and Extended Data Figs 7 and 8), density was resolved 
for all «-helices in the ATD and LBD assemblies, and also for the M3 
helix bundle, the upper segment of M1 and the pre-M1 cuff helix in the 
ion channel (Fig. 4b). The density map reveals preservation of two-fold 
symmetry in the ATD layer while the LBD layer adopts a quasi-four- 
fold symmetric arrangement (Fig. 4c). To obtain a molecular model for 
the desensitized state, we fitted two copies of GluK2 ATD dimer assem- 
blies (PDB 3H6G) and four copies of a GluK2 subunit LBD glutamate 
complex (PDB 3G3F). The resolution of our map is adequate to show 
unambiguously that in the desensitized state the ion channel adopts a 
closed conformation (Fig. 4b, panel 7) in which the M3 helices form a 
crossed bundle assembly with the pre-M1 helices wrapped around the 
outside of the channel, similar to that seen for GluA2q,¢ in its antagonist- 
bound closed state. 

To describe the nature and extent of the conformational changes 
that occur in the transition from the active to desensitized conforma- 
tions, we constructed a homology model for the GluK2 active state based 
on the cryo-EM structure of the GluA2 active state. Comparison of sub- 
unit orientations in ligand binding domain dimer assemblies for the 
Gluk2 active state (Fig. 4d left) with those in the desensitized state (Fig. 4d 
right) reveals that the distal subunits swing clockwise by ~ 125° in the 
horizontal plane, while the proximal subunits rotate by only ~13°. In 
the vertical plane the distal and proximal subunits tilt 11° and 6° away, 
respectively, from the global axis of symmetry. Asa result, in the GluK2 
desensitized state the LBD layer resembles an inverted pyramid in which 
the four subunits are arranged with quasi-four-fold symmetry. The ATD- 
LBD linkers which mediate the two-fold to four-fold symmetry transition 
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Figure 4 | GluK2 receptor desensitization. a, GluK2 desensitized state 
density map at ~7.6 A resolution, segmented and coloured to show four 
receptor subunits, fit with coordinates for Gluk2 ATD dimers (PDB 3H6G) 
and glutamate-bound Gluk2 LBD monomers (PDB 3G3F). The GluA2rys¢ 
transmembrane domain was fit as a rigid body. Portions of transmembrane 
helices where density was only weakly resolved are shown in white. b, Close-up 
views of selected regions of the density map labelled in a. c, Top views of density 
maps for ATD (upper panel) and LBD (lower panel) layers. Coloured dots 
connected by dashed lines identify the locations of His 3 and Met 382 at the top 
and base of the ATD, with the progressively smaller parallelograms for Leu 402, 


easily support these large movements, with less than a 1 A increase in 
distance between the Ca atoms of Met 382 and Leu 402 which connect 
the ATD and LBD for the A and C subunits, and a 5-6 A decrease in 
separation for the B and D subunits. Rupture of LBD dimer assemblies, 
in which the distance between the Lys 500 Ca. atoms at the location ofa 
conserved intermolecular salt bridge in the upper lobe increases by 40 A 
in the desensitized state, is in good agreement with both surface access- 
ibility measurements in GluA2 (ref. 24) and the effect on the kinetics of 
desensitization of mutations that alter the Kg for the GluK2 LBD dimer 
assembly”>”®. 

Visualization of the probable trajectory of the distal subunit along 
the arc that it travels during the transition from the active to desensi- 
tized states (Supplementary Video 2) suggests an explanation both for 
the previously reported crystal structures of crosslinked GluA2 dimers”, 
and for the heterogeneous conformations observed for the GluA2 de- 
sensitized state (Fig. 3). Superposition of a proximal subunit of the GluK2 
desensitized state tetramer with the proximal subunit from each of the 
three GluA2 disulphide crosslinked dimer assemblies (previously pro- 
posed to represent the desensitized conformation”) suggests that these 
crystallized conformations are structural intermediates where further 
movement is prevented by disulphide crosslinks. Taken together, these 
observations reinforce the idea that AMPA and kainate receptors desen- 
sitize by a conserved mechanism. Notably, owing to the substantial reor- 
ganization of the LBD, residues in o-helix G, but from different subunit 
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Lys 500 and Ser 669 indicating the top, middle and base of the LBD. 

d, Structural changes in an LBD dimer assembly underlying the transition from 
the active (blue) to desensitized (magenta) states presented as side (upper 
panel) and top (lower panel) views. o-Helix J, highlighted as a transparent 
cylinder, and loop 1, marked by an asterisk illustrate the magnitude of LBD 
rotation with desensitization. Dashed lines show the approximate location 

of the planar interface between subunits in the domain dimer. e, Movement of 
the S1-M2 linker (Arg 512), M3-S2 linker (Asp 638) and $2-M4 linker 

(Gly 772) indicates how LBD tetramer movements drive channel closure; 
arrows show the direction of movement from active to desensitized states. 


combinations, are in close proximity in both closed and desensitized 
states, providing an explanation for recent biochemical crosslinking 
studies on GluA2 that failed to detect state-dependent trapping at 
these positions’®. 

This raises the question of how this rearrangement of the LBD af- 
fects the ion channel. In both the desensitized state and the antagon- 
ist-bound state the ion channel has a closed conformation. However, 
in the desensitized state the linkers connecting the LBD to the channel 
adopt ~ four-fold symmetric arrangements, with changes in dimensions 
of the parallelograms formed by the S1-M1, M3-S2 and S2-M4 linkers 
from 41 X 25 A, 43 X 15 A and 46 X 36 A in the active state, to nearly 
symmetric values of 34 X 37 A, 24 x 26 A and 60 X 56 A, respectively, 
in the desensitized state (Fig. 4e). Most notably, the origin of rotation 
for the four subunits in the transition from the active to desensitized 
state is located close to the S1-M1 linker, such that the M3-S2 and S2- 
M4 linkers rotate clockwise around $1-M1, with a much larger radial 
sweep for the distal versus proximal subunits. In the vertical plane, the 
three transmembrane linkers for the distal subunits move upwards by 
5,10and7A (Fig. 4e) relieving the downwards movements that occur 
in the transition from the closed to the active state (Fig. 2f). However, in 
the desensitized state the linkers adopt a planar arrangement, strikingly 
different from the closed state in which the transmembrane linkers form 
twisted parallelepipeds°. Despite these large movements, measurement 
of the change in distance between the Ca atoms of Arg 512 and Val 521, 
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Val 630 and Asp 638, and Gly 772 and Val 786 reveals that in the desen- 
sitized state the SI-M1, M3-S2 and S2-M4 linkers increase in length 
by only 4-5 A for the proximal A/C subunits, while for the distal B/D 
subunits the M3-S2 and $2-M4 linker length decreases by 3 A. 


Molecular mechanism of receptor gating 


AMPA and kainate receptors are widely regarded as functionally and 
structurally related families. Our study exploited the availability of unique 
AMPA receptor allosteric modulators to trap GluA2 in the active state, 
while the greater thermodynamic stability of the GluK2 desensitized 
state yielded a higher resolution structure than could be achieved for 
GluA2.m. On the basis of the similarity observed in the LBD layer in the 
desensitized states of both GluA2,,,, and GluK2, we conclude that tran- 
sition to quasi-four-fold symmetry in the LBD layer is a key structural 
signature of desensitization in these iGluR families. On the basis of these 
similarities, we propose a unified description of the structural changes 
that occur during the gating cycle of receptors in the iGluR family (Fig. 5). 
Several lines of evidence justify this approach: after genetic removal of the 
ATD, both GluA2 and Gluk2 display gating properties that are similar 
to those of the parent receptors” ”’, with activation and desensitization 
on the millisecond timescale, indicating that the conformational change 
underlying activation and desensitization occurs primarily in the LBD 
and ion channel assemblies, independent of conformational variabil- 
ity in the ATD. Consistent with this observation, numerous crystal- 
lographic studies of soluble GluA2 and GluK2 LBDs have established 
essentially identical sets of structures and extents of movement for com- 
plexes with agonists and antagonists”. In the ion channel domain, GluA2 
and GluK2 share 73% amino acid sequence similarity, and exhibit com- 
mon functional properties including prominent subconductance states’, 
similar relative permeability to sodium and calcium ions”’, and chan- 
nel block by cytoplasmic polyamines”. 

Binding of glutamate triggers both clamshell closure and a rotation 
of LBD dimer assemblies; this necessitates compensatory movement 
elsewhere in the protein. As a result, the upper lobes of the LBD pull 
‘down’ the ATD layer, while the lower lobes of the LBD exert both lat- 
eral and upward forces to open the channel. Strain in the open state is 
released by transition to a desensitized state in which the channel closes 
with the LBD remaining in the closed-cleft, glutamate-bound confor- 
mation. We answer the question of how the receptor, in its desensitized 
state, simultaneously accommodates both a closed cleft, which intro- 
duces molecular tension, and a closed channel, which requires release 
of tension, by showing that in the desensitized state the LBD layer under- 
goes a marked rearrangement featuring a two-fold to four-fold symmetry 
change. The resulting four-fold symmetry in the LBD layer matches that 
of the ion channel in its non-conducting state, and thereby permits the 
channel to adopt a low-energy conformation. In this way, ligand disso- 
ciation kinetics are decoupled from channel activation and deactivation. 


Resting Open Desensitized 


‘hip eeg 


Figure 5 | Unified view of glutamate receptor gating cycle. a, Schematic 
summary of global conformational changes highlighting domain movements 
with channel opening and closure during the receptor gating cycle. The dashed 
lines over the open state indicate the shortening as a result of the corkscrew 
rotation that opens the channel. The differences observed between desensitized 
states of GluK2 and GluA2 are illustrated as variations of a common theme in 


which the LBD layer shifts to four-fold symmetry with or without separation of 
the ATD dimer pairs. 


ARTICLE 


It is notable that while both AMPA and kainate receptors adopt four- 
fold symmetry in their desensitized LBD layers, AMPA receptor desen- 
sitization also causes a disruption of the ATD layer. This result can be 
understood by considering the symmetry mismatch within the recep- 
tor, and changes in symmetry during the gating cycle. In the closed and 
open states, both the ATD and LBD layers have two-fold symmetry. 
The strain resulting from agonist binding to the LBD is centred near the 
LBD-transmembrane interface and is sufficient to open the channel. 
In the desensitization step, the LBD layer shifts from two-fold to four- 
fold symmetry, matching the four-fold symmetry of the ion channel; 
the strain in the receptor now shifts to the two-fold symmetric ATD. 
In GluK2, the ATD assembly appears to be able to withstand this strain, 
possibly relieving it by the drawbridge-like tilting at the ATD tetramer 
interface. However, in GluA2, this symmetry mismatch places sufficient 
strain on the ATD layer to disrupt the tetramer interface. This hypoth- 
esis is supported by measurements of subunit interactions by analytical 
centrifugation for isolated ATDs that reveal much weaker interactions 
of GluA2 ATD dimers than GluK2 ATD dimers***. 

A central value of single-particle cryo-EM methods used here is that 
they enable definition of functionally important large-scale receptor struc- 
tural changes without the constraints introduced by disulphide cross- 
links or crystal lattice contacts, and lay the foundation for screening 
potential receptor—drug interactions. At the same time, the need to use 
detergents** or amphipols*”** to stabilize membrane proteins for struc- 
tural analysis has the potential to disrupt functionally important protein— 
lipid interactions**°. Whether this has an impact on ligand-gated ion 
channel structures is an important area for future research. Other chal- 
lenging problems include obtaining a structural understanding of how 
glutamate receptor activation, at lower agonist concentrations than 
used in the present study, leads to subconductance states'®. In addition, 
it will be important to explore central mechanistic questions such as how 
individual LBDs move independently during the activation process”, 
how gating occurs in heteromeric iGluR assemblies such as NMDA re- 
ceptors, and whether the extent of cleft closure, which varies for partial 
agonists, has any consequence on either open state or desensitized state 
structures’. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Protein expression, purification and conformational trapping. The rat GluA2,;, 
subunit cDNA sequence (P19491-2) including the native signal peptide was cloned 
into the pFastBacl vector for baculovirus expression in Sf9 insect cells. The GluA2em 
construct contained the R586Q and C589A point mutations and was truncated at 
K826 to improved yield and tetramer stability. GluA2,,, differs from GluA2 <r. by 
restoration to the wild-type sequence of six residues in the ATD-LBD linker, four 
N-linked glycosylation sites, and four residues in loop 1 of the LBD. Likewise, the 
full-length rat GluK2 subunit cDNA sequence (P42260) including the native signal 
peptide and the carboxy-terminal domain was cloned into the pFastBacl vector; the 
construct was RNA edited at position 536 (I to V) and had two mutations (C545V 
(M1) and C564S (M1-M2 loop)) which increased yield and tetramer stability. In 
the LBD, four mutations (A47T, A658S, N690S and F704L), which convert the se- 
quence to that found in GluK1, were introduced to create a high-affinity binding 
site for the GluK1 selective antagonist LY466195”. For fluorescence detection** and 
affinity purification, a thrombin recognition site and linker sequence (GLVPRGSA 
AAA) was inserted between GluA2 and GluK2 and the coding sequence for the 
A207K dimerization suppressed EGFP mutant, with a C-terminal SGLRHiss affin- 
ity tag. Sf9 cells (12 1) were harvested 72h after infection, collected by low-speed 
centrifugation, and frozen at —80°C. Cell pellets were re-suspended in ice-cold 
buffer (18-20 ml1~') containing 150 mM NaCl, 50 mM Tris, pH 8.0, 0.8 1M apro- 
tinin, 2 4g ml! leupeptin, 2 UM pepstatin and 1 mM PMSF, and then disrupted on 
ice using a QSonica Q700 sonicator (18 X 15s, power level 7). The lysates were 
clarified by low-speed centrifugation, and membranes collected by ultracentrifu- 
gation (Ti45 rotor, 40,000 r.p.m., 45 min), followed by mechanical homogeniza- 
tion, and solubilization for 1h at 4°C in buffer containing 150 mM NaCl, 20 mM 
Tris pH 8.0, 50 mM n-dodecyl-f-b-maltopyranoside (DDM) and 8.5 mM choles- 
terol hemisuccinate (CHS) for GluA2, and DDM alone for GluK2. Insoluble material 
was removed by centrifugation (Ti45 rotor, 40,000 r.p.m., 45 min) and cobalt- 
charged TALON metal affinity resin (20 ml) was added to the supernatant together 
with 10 mM imidazole. After binding for 90 min at 4 °C the resin was packed in a 
column, washed with buffer containing 150 mM NaCl, 20 mM Tris pH 8.0, 0.75 mM 
DDM, 40 mM imidazole (with 0.12 mM CHS added for GluA2) until the OD at 
280 nm reached a stable low value, and then eluted with an increase to 250 mM im- 
idazole. Peak fractions were digested overnight at 4 °C with thrombin at a 1:100 w/w 
ratio. GluA2 and GluK2 tetramers isolated by gel filtration chromatography (Super- 
ose 6 10/300) in a buffer containing 150 mM NaCl, 20 mM Tris pH 8.0, 0.75 mM DDM, 
and for GluA2 0.12 mM CHS, were concentrated to 2 mg ml! (100 kDa MWCO), 
and then stored on ice to trap the desired conformational state. For GluA2, 0.3 mM 
ZK200775 ([[3,4-dihydro-7-(4-morpholiny])-2,3-dioxo-6-(trifluoromethyl)-1(2H) 
quinoxalinyl]methyl]phosphonic acid) was added to stabilize the closed state; to 
trap the open state, 0.5 mM LY451646 (N-[(2R)-2-(4-cyano[1,1’-bipheny]]-4-yl) 
propyl]-2-propanesulfonamide) was allowed to bind for 30 min before addition of 
100 mM glutamate, with 15s elapsing before sample vitrification; to trap the de- 
sensitized state 1 mM quisqualate was allowed to bind for 20 min before vitrifica- 
tion; to reverse desensitization, 0.5 mM LY451646 was added to the quisqualate 
bound protein and allowed to equilibrate for 30 min before vitrification; for GluK2, 
the desensitized state was trapped using 1 mM 2S,4R-4-methylglutamate, as des- 
cribed previously’. Attempts to use the allosteric modulator concanavalin A to 
trap a GluK2 open state were unsuccessful due to aggregation of the receptor-lectin 
complex. 

Specimen vitrification and cryo-electron microscopy. Vitrified specimens were 
prepared by adding 2.5 ul of liganded GluA2 or GluK2 at 1.8mgml ‘ to R2/2 
holey carbon grids (Quantifoil, Jena, Germany) rendered hydrophilic by chemical 
treatment to enable particle distribution into the holes (J.R.M., P.R., J.K., S.C., J.P., 
M.L.M. and S.S., manuscript in preparation). Grids were blotted for 2s, then 
plunge-frozen in liquid ethane using an FEI Vitrobot Mk IV (FEI Company, Hills- 
boro, OR), with the chamber maintained at 22 °C and 100% humidity. Following 
vitrification, grids were post-mounted into autoloader cartridges and transferred 
to the microscope. Cryo-EM imaging was done on an FEI Titan Krios microscope 
(FEI Company, Hillsboro, OR) operated at 300 kV, aligned for parallel illumina- 
tion, and equipped with a high-brightness XFEG and Cs corrector. Projection images 
were acquired as seven-frame movies with a 4,096 X 4096 back-thinned Falcon II 
CMOS detector at a nominal magnification of 47,000, corresponding to a pixel 
size of 1.406 A at the specimen plane. Imaging was carried out using FEI EPU auto- 
mated data-acquisition software to collect approximately equal numbers of images 
at nominal focus values of —2.0, —2.5, —3.0 and —3.5 um, and with a dose rate and 
exposure time of 20eA~*s_! and 3.5s, respectively. 

Image processing. Unbinned image stacks were corrected for drift and beam- 
induced motion by alignment using cross-correlation as implemented in IMOD 
tiltxcorr*’. Particles were manually identified and selected using the program e2boxer 
in the EMAN2 program suite**. Integrated and unintegrated versions of aligned 
multi-frame images were processed in the framework of Relion (version 1.2)*”. The 


ARTICLE 


integrated images were used for CTF estimation with CTFFIND3* as implemen- 
ted in the Relion workflow. Extracted particles were normalized, and subjected to 
25 rounds of both iterative 2D classification (regularization parameter T = 2) and 3D 
classification (T = 4) with C2 symmetry imposed. Uninterpretable, low-population, 
or poorly defined classes were discarded at both stages. Single particles were then 
processed using the Relion auto-refine routine until convergence, at which point 
frames corresponding to a combined dose of ~25e A”? were substituted for the 
integrated frames, and used for final refinement. Density maps were B-factor cor- 
rected in Relion and ‘gold-standard’ FSC resolution plots were calculated using 
the EMAN2 program e2proc3d** with a soft shape mask applied to independent 
unfiltered half maps from Relion. To visualize variation in resolution across the 
maps, the blocres utility*” was used to calculate local resolution maps and colour 
the density maps accordingly. The desensitized GluA2 maps were not visualized 
in this way as their low resolutions limit interpretation to qualitative terms. Three- 
dimensional image processing for all conformational states was bootstrapped using 
a 60 A resolution map of the GluK2 receptor determined by cryo-electron tomo- 
graphy and subvolume averaging’’ as an initial model. Extended Data Table la 
contains the number of micrographs and the number of particles used at all stages 
of image processing. 

Structural analysis. Fitting of coordinates into density maps, segmentation and 
visualization were all carried out using UCSF Chimera*’. Measurement of inter- 
subunit distances was done using Cx atoms at reference points. For GluA2 and 
(GluK2), Ser 5 (His 3) in B-strand 1 locates the top of the ATD; Ile 203 (Leu 212) 
at the base of helix 7 measures the distance between the proximal B and D subunits 
in the ATD dimer of dimers interface; Leu 378 (Met 382) in B-strand 15 defines the 
base of the ATD, preceding the ATD-LBD linker. After the ATD-LBD linker Val 395 
(Leu 402) in B-strand 1 locates the start of the LBD; Lys 493 (Lys 500) in the loop 
between B-strands 6 and 7 defines the intermolecular salt bridge that links the upper 
lobes of dimer pairs formed by the AD and BC subunits; Ala 665 (Ser 669) defines 
the start of helix G in the lower lobe. To locate start of the LBD-transmembrane 
linkers we followed the selections used for GluA2,,ys:°. Thus, we used Lys 505 
(Arg 512) in B-strand 7 for S$1-M1; Glu 634 (Asp 638) at the C terminus of o-helix E 
for M3-S2; and Gly 771 (Gly 772) adjacent to the LBD conserved disulphide bond 
for S2-M4. We chose these positions cognizant of the fact that ATD and LBD 
crystal structures we fit to EM maps were engineered, replacing the ion channel 
and LBD-transmembrane linkers in the LBD with a GT dipeptide. We established 
by superposition of GluA2 ATD, LBD agonist and LBD antagonist complex crystal 
structures on GluA2,,.y._ that these reference positions were not perturbed in the 
soluble ATD and LBD crystal structures; then, by superposition of GluK2 ATD and 
LBD crystal structures we identified structurally equivalent positions in GluK2. 
Structural analysis was done on domain coordinates that were rigid body fit into 
cryo-EM density maps. The reproducibility of rigid body fits was established by 
using a coordinate orientation randomization scheme. First, domain rotation was 
randomized between +10° on all three axes, and the translation randomized by 
+10 pixels (14.1 A) onall three axes. The coordinates were then re-fitted, and then 
saved as a new PDB file. This was repeated five times and the r.m.s.d. value bet- 
ween the structures was found to be well below 1 A. This indicated that map quality 
was high enough to permit the Chimera fitting routine to converge on the same 
minima reproducibly. The final maps for GluA2 in the open state and GluK2 in the 
desensitized state were validated using the tilt-pair parameter plot*! using pairs of 
images at zero tilt (first exposure) and 10° tilt (second exposure). Using the final 
refined versions of the respective density maps as the reference 3D model, orienta- 
tions of manually selected particles were assigned using FREALIGN and plotted 
using the TILTMULTIDIFF program”. 
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spectra and CTF estimates showing signal beyond 8 A resolution (right panels, 
solid and dotted lines, respectively). Defocus values are 3.7, 2.7 and 3.0 um for 


the three images, respectively. Scale bar, 100 nm. 


Extended Data Figure 1 | Cryo-electron microscopic imaging of GluA2 with 
ZK200775. a-c, A series of representative images of GluA2 bound by the 
competitive antagonist ZK200775 (left panels), with corresponding power 
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Extended Data Figure 2 | Antagonist-bound closed state GluA2 density 
map quality and resolution. a, b, GluA2.,,, antagonist-bound closed state 
density map with coordinates for ATD dimers, LBD dimers, and the 
transmembrane domain tetramer independently fit to the map. All coordinates 
were derived from PDB 3KG2. In panel b the density map is shown at a higher 
contour than a to highlight the closeness of fit between X-ray coordinates 
and the density map in the ATD and LBD layers. The density for the ATD-LBD 
linker region is weaker than that in the rest of the map and is therefore not 
visible at this threshold. The black bounding box in b identifies the M3-helix 
bundle crossing visible in the density map. c, Visualization of density map to 
highlight variation in resolution across different regions of the map. The 
estimated resolution value is colour-coded using the scale shown at the bottom 


Resolution (1/A) 


0.06 0.08 0.10 


degrees 


edge of the panel. d, Expanded versions of selected regions of map. Roman 
numerals identify helices 6 and 8, loop 1, and the pre-M1 and M1 helices as 
indicated in panels a and b.e, A set of plots that include gold-standard FSC plot 
(black line) for the GluA2,,,, antagonist-bound closed state density map 
showing a resolution of 10.4 A atan FSC value of 0.143, anda plot (red line) of 
the FSC between the experimentally obtained cryo-EM density map and a 
map computed from the fitted coordinates, which displays a resolution of 
10.6 A at an FSC value of 0.5, consistent with the gold-standard FSC curve. 

f, Validation of density map using tilt-pair parameter plot. The spread in 
orientational assignments around the known goniometer settings is within 
~25° for >80% of the selected particle pairs, with clear clustering observed at 
the expected location, centred at a distance of 10° from the origin. 
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ATD-only fit 


Extended Data Figure 3 | Assessment of correspondence between GluA2.., 
and GluA2.,ys¢. a, b, Density map of antagonist-bound closed state GluA2.m 
with rigid body fits of GluA2,ys¢ (PDB 3KG2) reveals separation between 
the ATD and LBD layers in GluA2.., that is absent in GluA2,,,<¢ due to deletion 
of six residues in the ATD-LBD linker. In a, GluA2 ys fitting was performed 
using only ATD tetramer coordinates, which reveals a good fit of the ATD 


LBD-only fit 


layer, but at the expense of the closeness of fit of the LBD assembly. Conversely, 
in b fitting was performed using only LBD tetramer coordinates, which reveals 
a good fit of the LBD layer, but at the expense of the closeness of fit of the 
ATD assembly. The black boxes highlight examples of regions where the 
mismatches are clearly evident. 
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Fourier Shell Correlation 


Extended Data Figure 4 | Open state GluA2 density map quality and 
resolution. a, b, Density map of glutamate-bound GluA2,,,, in the open state 
with coordinates for ATD dimers (PDB 3KG2) and glutamate-bound LBD 
dimers (PDB 1FT)) fit separately into the map. In panel b the density map is 
shown at a higher contour than a to highlight closeness of fit between X-ray 
domain coordinates and the density map. c, Secondary structural features 
from ATD chains B/D of the density map corresponding to regions marked in 
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panel b. Roman numerals identify helices 5, 6, 7 and the ATD lower domain 
B-sheet. d, Gold-standard FSC plot (black line) for the GluA2,.., open state 
density map showing a map resolution of 12.8 A at an FSC value of 0.143, anda 
plot (red line) of the FSC between the experimentally obtained cryo-EM density 
map and a map computed from the fitted coordinates, which displays a 
resolution of 12.7 A at an FSC value of 0.5, consistent with the gold-standard 
FSC curve. 
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Extended Data Figure 5 | Desensitized state GluA2 density map classes and _b, Gold-standard FSC plots for the GluA2.m desensitized state density maps 
resolutions. a, Three quisqualate-bound GluA2,,, desensitized state classes showing resolutions of 21.4 A, 25.9 A and 22.9 A for classes 1, 2 and 3, 


resolved through 3D classification. The maps are the same as those presentedin _ respectively at an FSC value of 0.143. 
Fig. 3b, but without segmentation to identify the ATD and LBD regions. 


©2014 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


144A 146A 18A 20A 22A 


12A 14A 16A 


8A 10A 


Extended Data Figure 6 | Restored open state density map for the GluA2 
quisqualate-LY451646 complex. a, Density map for the GluA2,,, open 
state obtained by addition of the allosteric modulator LY451646 to a suspension 
of quisqualate-bound, desensitized GluA2. The purpose of the experiment 
was to test whether structural changes resulting from quisqualate binding to 
generate the desensitized state could be reversed by addition of an excess of the 
allosteric modulator LY451646, used to stabilize the open state. The map 
display shown on the left is colour-coded to highlight variation in resolution 


across different regions of the map. b, Density map for the glutamate-bound 
open state obtained by addition of LY451646 30 min before agonist, as 
shown in Fig. 2. The map display shown on the left is colour-coded as in a to 
highlight variation in resolution across different regions of the map. 
Comparison of the two maps and the fits of ATD and LBD dimers shows 
that they are essentially identical, establishing that the conformational changes 
that occur with desensitization are reversible and can be modulated by allosteric 
modulators. 
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Extended Data Figure 7 | Cryo-electron microscopic imaging of GluK2 with desensitized GluK2 particles subjected to single-particle analysis. d, Gold- 


28,4R-4-methylglutamate and 2D classes. a, b, Representative cryo-EM standard FSC plot (black line) for the GluK2 desensitized state density map 
image of GluK2 bound by the agonist 2S,4R-4-methylglutamate (a), with the | showing a map resolution of 7.6 A atan FSC value of 0.143. A plot (red line) of 
corresponding image power spectrum and CTF estimate showing signal the FSC between the experimentally obtained cryo-EM density map and a map 
beyond 8 A resolution (b, solid and dotted lines, respectively). The defocus computed from the fitted coordinates displays a resolution of 7.7 A at an 


value of the image is 3.7 um. Scale bar is 100 nm. c, Two-dimensional classes of | FSC value of 0.5, consistent with the gold-standard FSC curve. 
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Extended Data Figure 8 | Resolution of the desensitized GluK2 density 
map. a, GluK2 desensitized state map shown at increasing contour levels from 
left to right, to better highlight selected secondary structural features. 

b, Validation of density map using tilt-pair parameter plot. The spread in 
orientational assignments around the known goniometer settings is within 
~25° for >60% of the selected particle pairs, with clear clustering observed at 
the expected location, centred at a distance of 10° from the origin. c, Distal (left) 
and proximal (right) ATD subunits fit with the corresponding X-ray 


coordinates (PDB 3H6G). d, Proximal (left) and distal (right) LBD subunits fit 
with the corresponding X-ray coordinates for glutamate-bound GluK2 

LBD monomers (PDB 3G3F). The close similarity in density maps for the 
individual ATD and LBD monomers of distal and proximal domains that are 
unrelated by computationally imposed C2 symmetry shows that the LBD 
monomers move largely as rigid bodies and that the structural changes that 
occur with desensitization can be described adequately by rigid body 
movements of the ATD and LBD monomers. 
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Extended Data Figure 9 | Comparison between single particle and 
tomographic reconstructions of desensitized GluK2. a, b, Single-particle 
reconstruction of desensitized GluK2 (a) shown adjacent to the previously 
reported structure from subvolume averaging (b). The overall envelope of the 
two structures is the same, but there is a difference in their lengths. This 
difference can be accounted for by considering the effect of the missing wedge 
on the tomographic structure in b. c-f, When the two receptor structures are 
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viewed looking down the receptor axis from the extracellular side, ATD layers 
(c, d) and LBD layers (e, f) can be seen to have the same arrangement. The 
ATD layer from the single-particle structure indicates contact within the ATD 
tetramer interface (c), and also between LBD monomeric domains (e). As a 
consequence of the missing wedge in the subtomogram average structure, 
lateral connectivity in the ATD layer (d) and LBD layer (f) is less evident. 
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Extended Data Table 1 | Data collection and structural analysis 


GluA2 
ZK200775 


GluA2 
LY451646 
Glutamate 


GluA2 


Quisqualate 


GluA2 


Quisqualate 


LY451646 


GluK2 


2S,4R-4-methylglutamate 


Structure 


Micrographs 


3,751 


1,566 


888 


1,303 


4,837 


Dimer Angle 


GluA2em LY451646 Glutamate 175° 


GluA2 model 1 (Sobolevsky) 
GluA2 model 2 (Dong) 
GluA2 model 3 (4L17) 
GluA2 model 4 (3H6W-A) 


GluA2 model 5 (3H6W-B) 


a, Data collection and image processing statistics (upper table). Results of image processing of GluA2 and GluK2 density maps included in the study. The table shows (from left to right): (1) the number of 
micrographs used for image processing; (2) the number of particles manually designated in the micrographs; (3) the number of particles retained after 2D classification; and (4) the number of particles retained 
after 3D classification and used for structure refinement. b, Comparison of GluA2.,, active state cryo-EM structure with previous computational models (lower table). A comparison of the GluA2,,, active state LBD 
tetramer assembly (row 1) with prior active state computational models (rows 2-6) was performed using three different measures: (1) by calculating the angle between dimer assemblies determined using vectors 
from the global centre of symmetry at the level of A665 CA atoms to V395 CA atoms in the proximal (AC) subunits (‘Dimer Angle’); (2) the distance between dimer pairs using proximal (AC) subunit V395 CA 
coordinates at the top of the LBD tetramer assembly (‘V395 Distance’); and (3) the cross-correlation between the GluA2,,, active state cryo-EM density map and various models (‘Correlation between EM map and 
model’). Model 1 was created as described®, by superimposing two copies of GluA2 LBD glutamate complex dimers (PDB 1FTJ) on GluA2c,ys¢, using domain 1 coordinates (r.m.s.d. 0.31 A). Model 2 was created as 
described°, generating a GluA2 LBD tetramer assembly by crystallographic symmetry operations from PDB 1FTJ. Model 3, GluA2 (4L17) is the activation intermediate tetramer assembly generated by 
crystallographic symmetry operations from PDB 4L17, for which the B and D subunit DNQX-bound protomers were replaced by glutamate bound protomers (PDB 1FTJ) as described previously*®. Models 4 and 5 
are two different glutamate-bound, GluA2 LBD tetramer assemblies generated by crystallographic symmetry operations from PDB 3H6W (ref. 52). Quaternary structural changes were visualized using PyMol®°. 
Morphs comparing the GluA2.,, active state LBD tetramer assembly with models 1-3 are shown in Supplementary Video 3. 
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Inefficient star formation in extremely metal 


poor galaxies 


Yong Shi'?, Lee Armus’, George Helou’, Sabrina Stierwalt*, Yu Gao”®, Junzhi Wang’, Zhi-Yu Zhang® & Qiusheng Gu! 


The first galaxies contain stars born out of gas with few or no ‘metals’ 
(that is, elements heavier than helium). The lack of metals is expected 
to inhibit efficient gas cooling and star formation’, but this effect 
has yet to be observed in galaxies with an oxygen abundance (relative 
to hydrogen) below a tenth of that of the Sun” *. Extremely metal poor 
nearby galaxies may be our best local laboratories for studying in 
detail the conditions that prevailed in low metallicity galaxies at early 
epochs. Carbon monoxide emission is unreliable as a tracer of gas at 
low metallicities*’, and while dust has been used to trace gas in low- 
metallicity galaxies”* ”°, low spatial resolution in the far-infrared has 
typically led to large uncertainties”"®. Here we report spatially resolved 
infrared observations of two galaxies with oxygen abundances below 
ten per cent of the solar value, and show that stars formed very inef- 
ficiently in seven star-forming clumps in these galaxies. The efficien- 
cies are less than a tenth of those found in normal, metal rich galaxies 
today, suggesting that star formation may have been very inefficient 
in the early Universe. 

The two galaxies that are the focus of this study are Sextans A, a dwarf 
irregular at a distance of 1.4 Mpc with an oxygen abundance of 7% of 
the solar value'*”’, and ESO 146-G14, a low-surface-brightness galaxy 
at 22.5 Mpc with a 9% solar oxygen abundance’. Their metallicities 
may be similar to that of the gas out of which population II stars formed 
in the early Universe at redshifts around 7 to 12 (ref. 14). An effective 
way to estimate the total gas content in extremely metal poor galaxies 
is to employ spatially resolved maps of the far-infrared-emitting dust 
as tracers of the atomic and molecular gas; this may be done by mulkti- 
plying the dust mass by an appropriate gas-to-dust ratio (GDR) that is 
inferred from regions with little or no active star formation, a method- 
ology that has been extensively demonstrated in relatively metal rich 
galaxies””*. 


Sextans A 


Figure 1 | False-colour, multi-wavelength images of our sample galaxies. 
a, Images of Sextans A: red, the sum of Herschel 160 and 250 Lm data; 
green, GALEX far-ultraviolet; blue, radio 21 cm data. The star-forming disk is 
defined as the large circle. The small white circles indicate individual dusty 


The infrared observations described here were carried out at wave- 
lengths of 70, 160, 250, 350 and 500 lm with the Photodetector Array 
Camera and Spectrometer (PACS)"* and the Spectral and Photometric 
Imaging REceiver (SPIRE)”’ on board the Herschel Space Observatory. 
We complement our far-infrared data with mid-infrared images from 
the Spitzer Space Telescope to construct the full infrared spectral energy 
distributions (SEDs). Far-ultraviolet images from the GALEX Space 
Telescope archive are used to trace un-obscured star formation. Maps 
of atomic gas are available in the literature for Sextans A‘* and ESO 146- 
G14”. 

Figure 1 shows multi-wavelength images of our sample galaxies. We 
defined the star-forming disk as an ellipse to closely follow the 100 
(~26 AB mag per arcsec’) contour of the far-ultraviolet emission, as 
shown in Fig. 1 and listed in Table 1. Individual star-forming clumps 
within the disk are identified as circled regions with elevated (>3a) emis- 
sion relative to local disk backgrounds in both far-ultraviolet and 160 um 
bands after smoothing images to 28 arcsec resolution. The diffuse emis- 
sion is measured by subtracting the total emission of all star-forming 
clumps in the disk from the integrated disk emission. For Sextans A, we 
also identified several individual diffuse regions that show extended emis- 
sion in the 70 and 160 jim bands but with surface brightness below 30 
of local disk backgrounds. In order to derive the dust mass, including 
both diffuse and clumped components, we fitted the infrared SED with 
a standard dust model””. The best-fit SEDs are shown in Fig. 2, and de- 
rived dust masses are listed in Table 1. 

With spatially resolved dust and H1 maps, the total gas masses of in- 
dividual star-forming clumps can be derived by multiplying their dust 
masses with the appropriate GDR based on regions with little or no star 
formation. As is usually done for nearby galaxies””*, the GDR in the star- 
forming clumps is taken as the ratio of atomic gas to dust in the diffuse 


20" ESO 146-G14 


star-forming clumps, while small yellow dashed ellipses indicate individual 
diffuse regions. b, Images of ESO 146-G14; colours as a. The disk is indicated 
as the large ellipse while individual star-forming clumps are shown as 

small circles. 
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Table 1 | The properties of the sample 


Region Right ascension Declination ma* X Mp* (arcsec; kpc) Dust mass (Ma) Mui/Maustt log& past logX srr 

(h:min:s; J2000) (°’",J2000) (log Mo pe?) (log Ms yr? kpe~?) 
SextansA/disk 10:11:01.4 —04:41:25 152 x 152; 1.06 x 1.06 (S52 x 103 (6.7708 x 108 
SextansA/sf-1 10:10:56.9 —04:40:27 22 x 22;0.16x0.16 = 9.9+28)x 102 (1.3428) x103 2.26 + 0.23 -2.66+0.2 
SextansA/sf-2 10:11:10.0 —04:41:44 22 X 22; 0.16 x 0.16 (2.0463 x 108 (Li SH eS x 108 2.577021 -—2.77 £0.2 
SextansA/sf-3 10:11:06.2 —04:42:23 32 X 32; 0.22 x 0.22 (1.8404 x 108 (3.2468 x 108 2:21 tp ce 2.32 20.2 
SextansA/sf-4 10:10:55.5 —04:42:59 22 X 22; 0.16 x 0.16 (1.6461 x 108 (4.1438 x 107 2.46+031 —3.19+0.2 
SextansA/diff-1 10:10:53.2 —04:41:43 38 x 20; 0.26 x 0.14 (5.1*28) x 102 (6.92) x 108 
SextansA/diff-2 10:11:09.2 —04:41:02 21 x 14;0.15 x 0.10 (1.8493 x 102 (8.6468 x 108 
SextansA/diff-3 10:10:54.0 —04:40:44 27 X 18; 0.19 x 0.13 (3.2406 x 102 (6.6406 x 103 
SextansA/diffuse (B:lee3 x 103 (1.4404 x 104 
ESO146-G14/disk 22:13:01.3 —62:04:00 90 x 15; 9.34 x 1.56 (5.9402 x 105 (25202 x 103 
ESO146-G14/sf-1 22:13:06.0 —62:03:33 10 x 10; 1.04 x 1.04 (Fiore x 104 (1.6402 x 108 1.214638 —3.46+0.2 
ESO146-G14/sf-2 22:13:02.5 —62:03:52 10 x 10; 1.04 x 1.04 (6.2468 x 104 (2.7282 x 103 1.124028 —3.26+0.2 
ESO146-G14/sf-3 22:12:59.0 —62:04:14 10 x 10; 1.04 x 1.04 (2.7463 x 105 (5.0+3%2 x 102 1,77+631 —3.65 +0.2 
ESO146-G14/diffuse (2.5463 x 105 (4.4462 x 103 
* Major and minor axis lengths are given in arcsec and kpc. 
+The atomic gas to dust mass ratio. The atomic gas includes helium by multiplying H! gas by a factor of 1.36. 
£ Surface densities of total gas masses for star-forming clumps are derived from their dust masses multiplied by gas-to-dust ratio of the integrated diffuse emission, with inclination correction based on the defined 
disk ellipse. 


{Surface densities of SFRs are derived from the combination of infrared and far-ultraviolet tracers®3, with inclination corrected based on the defined disk ellipse. 


region of the disk. This works because (1) the atomic gas dominates the 
total gas mass in the diffuse regions, and (2) the GDR is roughly con- 
stant in star-forming disks after removing the metallicity gradients’*”’. 
Dwarf galaxies in general show little or no metallicity gradients across 
their disks”, including Sextans A, which has a variation of less than 0.1 dex 
(ref. 12). Table 1 lists the derived gas masses of individual star-forming 
clumps corrected for inclination based on the GDR of the integrated 
diffuse emission (GDR = 1.4 X 10° for Sextans A and 4,400 for ESO 146- 
G14). For Sextans A, three individual diffuse regions have similar GDRs 
that are only a factor of 1.5—2 lower than the one derived from the inte- 
grated diffuse emission. Adopting a GDR of 140 at the solar metallicity”, 
our derived GDR values for the diffuse regions of the two galaxies scale 
roughly with metallicity Zas 1/Z'°-*”. For each star-forming clump, the 
star formation rate (SFR) is estimated by combining the far-ultraviolet- 
based (unobscured) and 24-|tm-based (obscured) SFRs”*. The uncer- 
tainties in the derived gas masses and SFRs are estimated to be around 
0.3 dex and 0.2 dex, respectively. 

Figure 3 shows the distribution of seven dusty star-forming clumps 
in the plane of SFR surface densities versus total gas mass surface den- 
sities, compared to spirals and merging galaxies***. When the dust is used 
to estimate the total gas (filled symbols), the metal-poor star-forming 
clumps appear to have significantly lower star formation efficiencies 


(SFEs) than those found in metal-rich galaxies, or those derived for the 
clumps using the H1 gas alone (open symbols). Four extremely metal 
poor clumps in Sextans A show almost two orders of magnitude lower 
SFEs compared to spirals when measured over the similar physical scales. 
This result still holds if we adopt GDR values of three individual dif- 
fuse regions, which causes the gas densities to only drop by 0.2-0.3 dex. 
For ESO 146-G14, one star-forming clump shows significantly (100) 
lower SFEs and the remaining two have SFEs about a factor of 10 lower 
than spirals at sub-kpc scales and similar gas densities. If any dark molec- 
ular gas is present in the diffuse region, the derived SFEs would be even 
lower. For our seven metal poor clumps as a group, the Kolmogorov- 
Smirnov test indicates a probability of only 10 * that their SFRs have 
the same distribution as the SFRs of spiral galaxies at comparable gas 
densities. 

As illustrated in Fig. 3, the gas masses of individual clumps, derived 
from dust masses, are much higher than the atomic gas masses, indi- 
cating high molecular gas fractions. By subtracting the observed atomic 
gas from the dust-derived gas mass for our seven star-forming clumps, 
we find that the derived molecular gas mass is on average ~6 times larger 
than the atomic gas mass. For the star-forming clumps, the median and 
standard deviation of the molecular SFE (in units of yr *) islog(SFEu, ) 
—10.8+0.6. The log of the corresponding molecular gas depletion 
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Figure 2 | Infrared SEDs of individual regions were fitted to derive dust masses. Red symbols are the Spitzer and Herschel photometric points with lo error 


bars. The blue solid line indicates the best-fit by the dust model”. 
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Figure 3 | Seven metal poor star-forming clumps show extremely low star 
formation efficiencies. Squares, data for Sextans A (7% solar metallicity, 
region radius 0.15 kpc); circles, data for ESO 146-G14 (9% solar metallicity, 
region radius ~1 kpc). Our data with 1c errors (filled symbols for dust-based 
total gas mass, while open symbols are for atomic gas only), were compared to 
spiral disks at sub-kpc scales’ (colour contours), and integrated spirals and 
mergers” (black lines) in the plane of the SFR versus gas surface densities. The 
green solid and dashed lines represent predictions of the model’ at 8% solar 
metallicity and a clumping factor of 1, and the model* at 8% solar metallicity. 


Dotted lines indicate constant SFEs (in yr’) of, from top to bottom, 10°, 


101°, 10-4, 107), 


time (in Gyr) is given by log ie ) =1.8+0.6. This is much larger than 


the typical depletion time at sub-kpc scales of local spirals or dwarfs 
Hp 
dep 


0.3 (refs 3, 4). Therefore star formation is strongly suppressed in the 
clumps, whether the star formation is scaled by total gas mass or by 
molecular gas mass alone. 

Although models'** of metallicity-regulated star formation predict 
significantly reduced SFEs at our metallicities, as illustrated in Fig. 3, the 
gas in the models is nevertheless mainly atomic, in contrast with our 
findings. The low SFE in the model’** is caused by greatly reduced for- 
mation of molecular gas on the surface of dust grains” or may possibly 
be tied to enhanced heating of the atomic gas’*. On the other hand, if 
we increase the depletion time of molecular gas in the models by an order 
of magnitude to ~20 Gyr, the gas in the model’* is still mostly atomic 
at our observed gas densities. Studies” of H1 dominated (by assuming 
low molecular gas fractions) dwarf galaxies suggest that their SFEs are 
not significantly lower than spiral galaxies. Our results extend these pre- 
vious findings to lower metallicity, and suggest high molecular gas frac- 
tions do exist in star-forming clumps of extremely low metallicity. 

The nature of this excess dust-based gas mass is still unclear. This gas 
is most likely to be in the molecular phase, as the cold H1 should be 
detected by 21 cm observations. If it is cold molecular gas, there should 
be associated CO emission. Extended Data Table 6 lists the predicted 
CO flux for these metal poor star-forming clumps assuming the CO- 
to-H factor (~) tobe 500 Ma pe? (Kkms_')~', which is appropriate 
for regions of such low metallicity. It should be noted however, that 
there is large uncertainty in this value®. Region ‘sf-3’ in Sextans A does 
indeed have millimetre-wave observations**. Accounting for a filling 
factor of one third of dust emission in the CO beam, we estimate a 30 
upper limit to the CO flux that is about a factor of three above our pre- 
dicted value. Therefore much deeper millimetre observations would be 
needed to detect the CO emission from the excess cold molecular gas in 
Sextans A. It is unclear what mechanisms prevent this abundant molecular 


with oxygen abundance above 20% solar, which is about log (« 
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gas from forming new stars. It is possible that the molecular gas does 
not effectively cool due to intense radiation fields, slowing the SFRs 
in these environments. Warm H) gas with surface densities as high as 
50 Mo pe ” is seen in some blue compact dwarfs”. Although our two 
galaxies are not blue compact dwarfs, the SFR surface densities of the 
star-forming regions in our galaxies are comparable to those found in 
such dwarfs. This similarity suggests the possible presence of abund- 
ant warm H, in our two extremely metal poor galaxies. Extended Data 
Table 6 also lists the predicted H2 S(1) 17.03 um line flux based on the 
example of Mrk 996”. There are archived Spitzer spectroscopic obser- 
vations of the region ‘sf-3’ of Sextans A. Based on the archived reduced 
data, after accounting for the difference between the Spitzer aperture 
and the size of our ‘sf-3’, the observed Hz 17.03 pm flux is about 4 X 
10 !°Wm ”,a factor of two lower than our predicted value. 

The extremely metal poor galaxies may provide a close-up view of 
the highly inefficient star formation occurring in galaxies in the early 
Universe where population II stars formed out of gas whose metallicity 
was 1/10 solar or less'*. The suppressed SFEs in extremely low metalli- 
city galaxies at early epochs may be able to reconcile some tensions bet- 
ween observations and theoretical models for early galaxy evolution”*. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Observations and data reduction. Herschel infrared images were generated through 
scan map modes at 70 and 160 um with PACS, and small map modes at 250, 350, 
and 500 jum with SPIRE (PI: Y. Shi, PID: OT2_yshi_3). The half-power beam widths 
at these wavelengths are about 5”, 12”, 20", 28” and 39”, respectively. The mapping 
field size at each band is set to be at least 1.5 times the optical size (D2) of the galaxy, 
where Dj, is the B-band isophote at 25 mag arcsec *, which is large enough to pro- 
vide blank sky for sky removal. Excluding overheads, the effective integration times 
per sky position in the two PACS bands are 1.9 h and 1.6h for Sextans A and ESO 
146-G14, respectively, and in three SPIRE bands is 6 min for the two targets. For 
Sextans A at 160 tm, additional Herschel archived data (PI: D. Hunter) with sim- 
ilar exposure time to ours were further combined with our own observations. The 
data reduction was performed with unimap version 5.5”””° with procedures basic- 
ally following the standard one. To better recover the extended emission, the GLS 
map maker starts with the zero map and the length of median filter for the PGLS 
algorithm is set to be twice the default value (=60). Unimap is a Matlab-based Her- 
schel data processing software. Unimap takes as input the level 1 pipeline data as 
produced by Herschel Standard Product Generator (Version 11.1.0 for this work) 
and identifies signals in time ordered pixels. After removing glitch and drift, the final 
maps were made with pixel scales of 1”, 2”, 4”, 6" and 8" at 70, 160, 250, 350 and 
500 pum, respectively. 

The reduced far-ultraviolet images were obtained from the GALEX data archive 
hosted by the Multi-Mission archive at the Space Telescope Science Institute. GALEX 
has a spatial resolution about 5” in the far-ultraviolet*'. The exposure times for Sex- 
tans A and ESO146-G14 are 1,698 s and 111s, respectively. Sextans A has reduced 
Spitzer images at 3.6, 4.5, 5.8, 8.0, 24, 70 and 160 jm obtained by the LVL program” 
available in the NASA infrared science archive. The corresponding spatial resolu- 
tions are about 5” X 1/24 um. For ESO146-G14, the data at 3.6, 4.5, 5.8, 8.0 and 
24 «um were available in the Spitzer Heritage Archive at the Spitzer Science Center 
and the archived enhanced imaging products were used. The Spitzer 70 and 160 im 
integrated fluxes® of ESO 146-G14 were compared with our PACS measurements. 
Infrared flux measurements. Our Herschel flux measurements start with aperture 
definitions followed by sky subtractions. As shown in Extended Data Fig. 1, the 
aperture of the star-forming disk of each galaxy is defined as an circle/ellipse to closely 
follow the 10¢ contour of the far-ultraviolet image, corresponding to surface bright- 
ness levels of 25.9 and 26.2 mag arcsec * (AB magnitude) for Sextans A and ESO 
146-G14, respectively. The results of this study change little if 5 or 200 contours 
were used to define the disk aperture. Within each star-forming disk, star-forming 
clumps are defined as circular regions showing both elevated (>3<) far-ultraviolet 
and infrared emission at 160 jum after convolving two images to resolutions at PACS 
350 um (28 arcsec). Here the a is the standard deviation of pixel values within the 
star-forming disk. The clump radius is listed in Extended Data Table 1. For Sextans 
A, we also identified three individual diffuse regions that are below 3a local back- 
grounds but show extended emission at 70 and 160 1m resolutions. Within the 
disk, infrared point sources that do not have corresponding far-ultraviolet counter- 
parts are identified as background sources rather than star-forming regions in the 
disk, since none of the identified clumps has infrared/far-ultraviolet flux ratios smal- 
ler than about 0.2. 

The sky annuli were defined to be between 1.1 times and 1.5 times the disk aper- 
ture. The mode of the sky pixel brightness distribution is subtracted from the image 
in each case. However, since faint, undetected background sources can make the noise 
distribution non-Gaussian, we also test the validity of our results by subtracting the 
mean value of the sky after masking out bright sources in the sky annuli. 

In total, we derive three types of flux measurements for each region as listed in 
Extended Data Table 1. For the first, referred to as m1, we use the mode of the sky 
brightness after masking off all potential background sources in the disk. For the 
second, referred to as m2, we again subtract the mode of the sky distribution, but 
treat the suspected background sources as embedded star-formation regions in the 
disk. For the third, referred to as m3, we first mask out all potential background 
sources, then subtract the mean of the sky pixels. All potential background sources 
and bright sources within the sky annuli are identified through sextractor** with 
further visual checks. We use the first flux estimates, m1, as the fiducial for the anal- 
ysis presented here. 

For images at each wavelength, aperture photometry was performed after sub- 
tracting the sky background. The aperture corrections were further applied at each 
wavelength based on the corresponding point spread function at that wavelength. 
Diffuse emission within the disk is measured as the residual after subtracting the flux 
from all identified star-forming clumps. The result is presented in Extended Data 
Table 1 where each region has three measurements detailed above, referred as m1, 
m2 and m3 measurements. 

The flux measurements in the Spitzer band were carried out in a similar way to 
the Herschel m1 method and the result is listed in Extended Data Table 2. 
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Infrared flux uncertainty estimates. The Herschel flux uncertainties are given by 
the following formula: 


2 
fom 5 
photon,confusion ,2 2 2 
Age Avegion + Opsp— offset } + Fabs—calibration ( 1) 
SKY 


= 2 wn 4 
c= =— x Ategion v 


where Ayegion and A.yy are the area of target regions and sky annuli, respectively. 
Ophoton,confusion is the scatter of the sky pixel brightness distribution. Extended Data 
Table 3 compares our measured @photon,confusion to the predicted photon and con- 
fusion noises estimated using the Herschel Observing Tool (HSPOT) for our tar- 
gets. The noise in our images is consistent with the quadratic sum of the HSPOT 
photon and confusion noise to within a factor of two. The second term in the above 
equation gives the scatter of the derived sky brightness. opsr-offset is the flux uncer- 
tainty caused by the accuracy in positioning an aperture onto a given star-forming 
clump. For each star-forming region, we estimated this by randomly offsetting the 
peak ofa modelled point spread function to 1/2 the Nyquist sampled beam and mea- 
suring the flux variation within the given source aperture. The final term is the 
absolute flux calibration error taken to be 20% across all wavelengths based on the 
PACS and SPIRE instrument handbooks as well as systematic comparisonss**-** 
between PACS and MIPS measurement. Extended Data Table 1 lists the quadratic 
sum of the first three errors. The final error term is added in quadrature when doing 
the SED fitting but is not used in Extended Data Table 1 as it is a systematic error of 
the Herschel Space Observatory. Our estimated errors are quite reasonable com- 
pared to the expected point source flux errors from HSPOT (also listed in Extended 
Data Table 3). Note that although in the SPIRE bands the confusion noise is 2-3 
times higher than the photon noise, this can be mitigated by using a PACS 160 um 
prior on the position”. We can further compare our noise estimates to those from 
other Herschel observations of similar depth. For example, the Herschel lensing 
survey” reported a 1o point-source depth of 2.4 mJy at 250 pum and 3.4 mJy at 350 pm 
for on-source exposure per sky position of 36 min with position priors from short 
wavelengths, compared to our 2-4 mJy at 250 um and 3-5 mJy at 250 ym for our 
6-10 min on source exposures. 

We further carried out additional checks on the measured flux by comparing 
Spitzer and Herschel photometry. For Sextans A, we found that individual star- 
forming clumps as well as the diffuse region have 70 |tm Herschel fluxes consistent 
with Spitzer measurements within 30%. And the integrated light of Sextans A and 
ESO146-G14 at both 70 and 160 um are also consistent within 30% between the 
Spitzer and Herschel data sets. 

To check the possibility that the diffuse emission is due to the background fluc- 
tuations, we randomly position the source aperture over the observed field of view, 
and then compared the measured fluxes to the quoted error of the target diffuse emis- 
sion. For ESO 146-G14, we can randomly position about 30 apertures and found that 
none of them have S/N larger than 3 at bands where the diffuse emission is detected. 
For Sextans A, the observed field of view is not large enough for us to perform sim- 
ilar exercises. 

Infrared SED fitting and dust mass measurements. We fit the infrared data with 
the dust models” in order to estimate the dust mass of each region. As shown above, 
we have three types of flux measurements, and fit all three with the dust model. We 
choose a Milky Way grain size distribution” and fix the PAH fraction to the min- 
imum (the total dust mass that is in PAHs, qpayp is 0.47%) given the low metalli- 
city (the result does not change if this parameter is set free). To further check the 
effect of different dust grains, for the first type of flux measurements (m1), SMC 
and LMC dust grains that have different grain compositions and size distributions 
are also explored. Overall we thus have five dust mass measurements for each region; 
three of them are for different types of flux measurements with Milky Way grains 
(referred as m1-MW, m2-MW, m3-MW), and two are for two different grain size 
distributions fitted to the first type of flux measurements (m1-SMC and m1-LMC). 

In the following we take the m1-MW as an example to illustrate the fitting pro- 
cedure. The results are plotted in Fig. 2 and listed in Extended Data Table 4. To do 
the fit, a 4,000 K black-body spectrum was first added to represent the emission 
from stellar photospheres which dominates at <10 jm. The model was then left with 
four free parameters, including the dust mass, the minimum (U,yin) and maximum 
intensity (Ux) of the stellar radiation field that is responsible for heating the dust, 
and the fraction (1 — y) of dust exposed to the minimum starlight intensity (that is, 
Unmin)- Similar to studies of dust emission in spirals'’, U;ax was further fixed to a 
maximum of 10°. We then performed SED fitting with three free parameters for 
Sextans A and ESO 146-G14. 

As listed in Extended Data Table 4, the reduced 7° for the majority of the fits 
have values of around unity, while sf-1, sf-2 and diff-3 of Sextans-A have a reduced 
7 ofaround 10. As shown in Fig. 2, the 160 jum photometry of sf-1 and diff-3 shows 
large deviations from the best fit, while 24 and 70 um photometry of sf-2 deviates 
substantially from the best fit. Uncertainties in the derived dust mass were estimated 
by performing 100 fits to each source after adding in Gaussian noise. 
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Wealso carried out simple modified black-body fitting to infrared SEDs of Sextans 
Aand ESO 146-G14 that have enough far-infrared photometric data points. As listed 
in Extended Data Table 4, the dust temperature of star-forming clumps is between 
30 and 50 K while the dust in the diffuse region is about 30-40 K. 
Measurements of SFR and gas mass surface densities. The SFRs of star-forming 
clumps were measured by combining the far-ultraviolet and 24-|1m data, which 
represent the unobscured and obscured star-formation, respectively~*. The derived 
SFRs are uniformly assigned a 0.2 dex error to account for the systematic errors in 
deriving SFRs from ultraviolet and infrared photometry (the photon noise is com- 
paratively small). The SFR surface density is further corrected for inclination based 
on the major to minor axis ratio of the defined disk. The final result is listed in Table 1. 

With derived dust masses, we estimated the GDR of the diffuse region as the ratio 
of atomic to dust mass. The GDR of the integrated diffuse region is then applied to 
individual star-forming clumps to derive the total gas mass and thus the gas mass 
surface density (2,5). Extended Data Table 5 lists the result for five fit:—m1-MW, 
m2-MW, m3-MW, m1-SMC and m1-LMC. The associated uncertainties of 2'gas 
are the quadratic sum of errors of dust mass measurements, errors of GDRs of diffuse 
regions contributed by uncertainties on H1 and dust mass estimates of diffuse 
regions, 0.2 dex for the GDR variation across the disk based on studies of spiral 
galaxies’. 

The result of the m1-MW fit is used as a fiducial, as listed in Table 1 and shown 
in Fig. 3. Our conclusions of significantly reduced SFEs in seven metal poor star- 
forming clumps change little if adopting other fitting results in Extended Data Table 5. 
In addition, there are some concerns that the PACS may miss some extended emis- 
sion, although this is not seen by our comparisons to Spitzer and in investigations 
by others****. To test this effect, we artificially increased the PACS fluxes of the 
diffuse emission by 30% while keeping the SPIRE fluxes as they were; the resulting 
surface densities of gas masses of star-forming clumps drop by no more than 0.1 
dex. In addition, three individual diffuse regions of Sextans A have similar GDR, 
only a factor of 1.5-2 lower than that of the integrated diffuse emission, indicating 
that our GDR estimate is reasonable. 

We investigate if the derived X,,, can be significantly lowered by forcing changes in 
dust model parameters, specifically raising U,,in which can result in lower dust masses 
and hence lower gas surface densities and higher SFEs. In the following discussion, 


we take the m1-MW fit as the case study. For both targets, the best-fit Umin of all 
regions are relatively small. We thus keep the best-fit Umin for the diffuse region 
but gradually increase U,y;, of star-forming clumps to decrease their X,.,.. We find 
that the star-forming clumps in Sextans A can move into the spiral galaxy regime 
of Fig. 3 if the Uin rises above 20. However, in this case the corresponding 7 rises 
to 40-60. For star-forming clumps in ESO 146-G14, the Upin needs to be larger than 
15 to move into the spiral regime; however, these fits are again poor, with ~ values 
of 10-30. Therefore the significantly reduced SFEs of star-forming clumps in Sex- 
tans A and ESO 146-G14 should be robust to the change in their Umin. 


29. Traficante, A. et al. Data reduction pipeline for the Hi-GAL survey. Mon. Not. R. 
Astron. Soc. 416, 2932-2943 (2011). 

30. Piazzo,L. etal. Artifact removal for GD'S map makers by means of post-processing. 

IEEE Trans. Image Process. 21, 3687-3696 (2012). 

31. Morrissey, P. et al. The calibration and data products of GALEX. Astrophys. J. 173 

(Supp.), 682-697 (2007). 

32. Dale, D. A. etal. The Spitzer Local Volume Legacy: survey description and infrared 

photometry. Astrophys. J. 703, 517-556 (2009). 

33. Engelbracht, C. W. et a/. Metallicity effects on dust properties in starbursting 

galaxies. Astrophys. J. 678, 804-827 (2008). 

34. Bertin, E. et a/. SExtractor: Software for source extraction. Astron. Astrophys. Suppl. 

117, 393-404 (1996). 

35. Sauvage, M. Experiments in photometric measurements of extended sources. 

http://herschel.esac.esa.int/twiki/pub/Public/PacsCalibrationWeb/ 

ExtSrcPhotom.pdf (2011). 

36. Ali, B. Surface brightness comparison of PACS blue array with IRAS and Spitzer/ 

IPS images. https://nhscsci.ipac.caltech.edu/pacs/docs/Photometer/PICC- 

HSC-TN-029.pdf (2011). 

37. Paladini, R. et a/. Assessment analysis of the extended emission calibration for the 

PACS red channel. https://nhscsci.ipac.caltech.edu/pacs/docs/Photometer/ 

PICC-NHSC-TR-034.pdf (2012). 

38. Paladini, R. et al. PACS map-making tools: analysis and benchmarking. http:// 

herschel.esac.esa.int/twiki/pub/Public/PacsCalibrationWeb/pacs_mapmaking_ 
report_ex_sum_v3.pdf (2013). 

39. Elbaz, D. et al. GOODS-Herschel: an infrared main sequence for star-forming 
galaxies. Astron. Astrophys. 533, A119 (2011). 

40. Egami, E. et a/. The Herschel Lensing Survey (HLS): Overview. Astron. Astrophys. 
518, L12 (2010). 


©2014 Macmillan Publishers Limited. All rights reserved 


SextansA (360"x360") 


LETTER 


Extended Data Figure 1 | Multi-wavelength images of the two galaxies. small circles are star-forming clumps, and ellipses are diffuse regions. b, Images 
a, Images of Sextans A in (left to right) the far-ultraviolet, H1 gas, 70 um, of ESO 146-G14: wavebands and disks/ellipses as in a. 


160 jm and 250 jim dust emission. The large circle is the star-forming disk, 
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Extended Data Table 1 | PACS and SPIRE photometry for the selected regions 


region Right ascension Declination sizes(ma,mb) f(70u~m) f(160um) f(250um) f(350um) f(500um) 
(J2000) (J2000) _(aresec) mJy) (Jy) dy) dy) sy) 
SextansA/disk 10 1101.4 -04 41 25 152.0x152.0 636416 1024418 644430 308423 1244 18 


658+ 16 1098418 722+30 356+ 23 155+ 18 
605+ 16 979418 557430 236423 78418 
SextansA/sf-1 10 10 56.9 -04 40 27 225x225 40+ 2 5647 554 3 3243 16+ 3 
40+ 2 56+7 55:3 32:3 16+ 3 
39+ 2 5547 53253 30+ 3 14+ 3 
SextansA/sf-2 10 11 10.0 -04 41 44 22.5x22.5 1243 1474 18 11144 52+ 4 24+ 3 
12+ 3 147+ 18 1lit4 52+ 4 24+ 3 
qWt'3 146+ 18 109+ 4 50+ 4 22+ 3 
SextansA/sf-3 10 11 06.2 -04 42 23 32.3x32.0 265+4 296+ 24 164+ 5 89+ 4 3325 
26544 296+ 24 164+ 5 89+ 4 33:3 
4 294+ 24 160+ 5 854 4 30+ 3 
SextansA/sf-4 10 1055.5 -04 42 59 225x225 20+ 2 69+ 8 62+ 3 34+ 3 18+ 3 
20+ 2 69+ 8 62+ 3 34+ 3 18+ 3 
20+ 2 68+ 8 60+ 3 313 16+ 3 
SextansA/diff-1 10 10 53.2 -04 41 43 38.0x20.0 D+ 3 852+'5 47+ 4 2743 <13 
75+ 3 85:5 4744 Diet 3 <13 
T4+ 3 8345 434 4 23+ 3 <13 


SextansA/diff-2 10 11 09.2 -04 41 02 21.4x14.6 80+ 2 45+ 6 18+ 3 <10 <12 
30+ 2 45+ 6 1843 <10 <12 
30+ 2 44+ 5 1643 <10 <12 

SextansA/diff-3 10 10 54.0 -04 40 44 27.5x18.5 44+ 2 5225 3644 14+ 3 <13 
444+ 2 5245 3644 14+ 3 <13 
44+ 2 D'S: 3344 Lie 3 <1I3 

SextansA/diffuse 237418 453+ 39 248+ 33 994+ 26 <69 
2584 18 527439 326433 147426 <69 
210+ 18 414+ 39 73:33 <78 <69 

ESO146-G14/disk 22 13 01.3 -62 04 00 90.0x15.0 110+ 3 241+ 5 148+ 7 8147 
110+ 3 24145 14847 8147 


ESO146-G14/sf-1 22 13 06.0 -62 03 33 10.0x10.0 28+ 4 38+ 6 29+ V7 
28+ 4 38+ 6 29+ 4 173 
28+ 4 374 6 28+ 3 16+ 3 
ESO146-G14/sf-2 22 13 02.5 -62 03 52 10.0x10.0 36+ 52+ 8 28+ 12+ 
36+ 5 52+ 8 28+ 3 1223 
36+ 5 S18 21 12 
ESO146-G14/sf-3 22:12:59.0 -62 04 14 10.0x10.0 15s: 2 S749 49+ 6 3145 
152 S7T+9 49+ 6 3125 
15+ 5748 49+ 31+ 
ESO146-G14/diffuse 30+ 8 934 14 41+ 11 <31 
30+ 8 93+ 14 41+ 11 <31 


30+ 8 91+ 14 even il <31 


For each region, at each wavelength, we give three types of flux measurements (top to bottom; m1, m2 and m3, see text and Methods). The 1o flux errors are the quadratic sum of photon and confusion noise, scatter 
of the sky brightness, and uncertainties in the flux due to mis-centring of extraction apertures. The 3a upper limits are given where appropriate. The uncertainties in the absolute flux calibration are not included 
here, but are added in quadrature before performing the SED fitting as described in the text. 
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Extended Data Table 2 | Spitzer photometry 


region f(3.6ym) f(4.5um) £(5.8 um) f(8.0j1m) f(244m) 
(mJy) (mJy) (mJy) (mJy) (mJy) 

SextansA/disk 255.3340.06 157.29+0.05 108.28+0.26 59.46+0.23 28.98+3.00 
SextansA/sf-1 1.67+0.01 1.05+0.01 <0.12 0.6240.04 1.0940.14 
SextansA/sf-2 1.68+0.01 1.36+0.01 0.70+0.04 0.4840.04  3.25+0.34 
SextansA/sf-3 2.85+0.01 2.20+0.01 1.14+0.06 0.60+0.05  6.36-40.65 
SextansA/sf-4 1.1740.01 0.69+0.01 0.34+40.04 0.28+0.04 0.9740.13 
SextansA/diff-1 1.60+0.01 1.01+40.01 <0.15 0.8140.04 2.27+0.25 
SextansA/diff-2 11.96+0.01 7.43+0.01 5.830.03 2.81+0.03 1.11+0.13 
SextansA/diff-3 0.98+0.01 0.78+0.01 <0.12 0.9340.04  1.3340.16 
SextansA/diffuse 247.9640.06 151.99+0.05 106.0040.29 57.4740.26 17.3143.11 
ESO146-G14/disk 4.67+0.01 3.05+0.02 3.00+0.08 3.00+0.08 3.87+40.66 
ESO146-G14/sf-1 0.330.00 0.24+0.00 0.25+0.02 0.2940.02 1.0340.16 


ESO146-G14/sf-2 0.380.00 0.280.00 0.35+0.02 0.29+0.02 1.23+0.18 
ESO146-G14/sf-3 0.74+0.00 0.50+0.00 0.44+0.02 0.49+0.02 0.92+0.16 
ESO146-G14/diffuse 3.22+0.01 2.030.02 1.960.08 1.92+0.08  <2.17 


Spitzer photometric measurements were performed in a similar way to the Herschel m1 method. 
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Extended Data Table 3 | Measured sky noises of our observations compared to predictions by HSPOT 


galaxy/band Extended Source Point Sources 
Omeasured—sky OHSPOT,photon OHSPOT , confusion OHSPOT, photon OHSPOT,confusion 
Myist) Myist) (MIyist) ay) @y) 
SextansA/70..m 2.86 2.03 0.22 052 0.08 
SextansA/160.m 1.20 0.92 0.74 0.83 1.34 
SextansA/250..m 0.93 0.24 1.19 2.86 70 
SextansA/350..m 0.49 0.11 0.67 2.38 82 
ESO146-G14/70um 1.82 153 0.20 0.60 0.08 
ESO146-G14/160.m 1.10 0.99 0.74 133 1.33 
ESO146-G14/250um 0.75 0.24 1.18 2.86 70 
ESO146-G14/350um 0.46 0.11 0.67 2.38 8.1 


HSPOT, Herschel observation planning tool. See Methods for details of parameters given here. 
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Extended Data Table 4 | Fitting results 


region Umin Umax(fixed) x7/ dof = Maust My! Maust Taust 
(Mo) (K) 

SextansA/disk 2.0 106 0.01 1.31 95 F1.g)x108 (5.770°5)x10° 33+ 1 
SextansA/sf-1 i2 10° 0.00 900 t28 2)x10?_ (1 3H °)x103 4547 
SextansA/sf-2 12 0° 000 1441 @2? ok 10° «(1 38 Dx 10° 2842 
SextansA/sf-3 4.0 108 0.00 2.87 (1 att 3)x10° (3 8: $)x10° 38+ 3 
SextansA/sf-4 0.7 108 0.00 2.21 qd ort 110° (4. 178 B10? 2742 
SextansA/diff-1 40 10° 0.01 207 (5. 38: 102 (6. gif: 8)102 40+ 4 
SextansA/diff-2 5.0 106 0.00 0.14 (1 gt °3)x10? (8.679 )x108 30+ 8 
SextansA/diff-3 40 106 0.01 720 © 2i8 x10? (6.6*5'9)x10° 3746 
SextansA/diffuse 25 108 0.01 0.05 (3. #8) “$)x108 —(1.4491)x104 29+ 3 
ESO146-G14/disk 1.5 10° 0.01 413 (5 ott 2)x10® (2.5*9-2)x108 3041 
ESO146-G14/sf-1 2.5 108 0.01 3.45 7 ste jyxt0" (1.640:2)x103 44+ 12 
ESO146-G14/sf-2 40 106 0.01 0.19 (6 40: eyx10* 12: 7 3108 3145 
ESO146-G14/sf-3 0.7 ~~ -:108 0.01 365 (2 78: 105 (5.12:2)x102 28+. 4 
ESO146-G14/diffuse 0.7 10° 0.12 17 # @: 50: 4%, 10° (4 ats 2)x103 2547 


Key derived parameters from fitting the dust model to the m1 flux measurements of Extended Data Table 1 and 2. In addition to the flux errors reported in Extended Data Table 1, the uncertainties in the absolute 
flux calibration were added before performing the fits, as described in the text. The last column is the dust temperature as given by modified black-body fitting. 
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Extended Data Table 5 | Gas mass surface densities given by models of different dust types 


region logy logue. tall logyras logos 
(logM3/pe”) (logM_5/pe?) (logM.,/pe*) (logM./pe”) (logM./pe?) 
SextansA/sf-1 226 a 210 co I. Ca) Goes Lio 
SextansA/sf-? 25 oa 240'T a | bi 2a DGS as 
SextansA/sf-3 221 205i 135e= 225182 2312 
SextansA/sf-4 2G a eer pe ee ps gor pe i= 
ESO146-G14/sf-1 1.214938 120 | ie t- Lincs Liss 


ESO146-Gl4sf-2 1.1272 11738 116% joo#2 11612 
FSO146-G14/sf-3 1.77422! 1.774022, 1 got = 1 gnt822 1 gn +022 


Gas surface densities ¥,,, were derived from dust masses based on infrared SED fitting by dust models of Milky Way (MW), Small Magellanic Cloud (SMC) and Large Magellanic Cloud (LMC) grains. 
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Extended Data Table 6 | Predicted CO and warm Hz line fluxes 


region Ico f,(S(1)-17.035um) 
(Kkm/s) (Wm?) 
SextansA/sf-1 0.33 2.0E-17 
SextansA/sf-2 0.67 1.5E-16 
SextansA/sf-3 0.25 6.1E-16 
SextansA/sf-4 0.56 1.1E-16 
ESO146-G14/sf-1 0.02 4.2E-18 
ESO146-G14/sf-2 0.01 1.1E-17 
ESO146-G14/sf-3 0.10 5 4E-18 
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Binary orbits as the driver of y-ray emission and 
mass ejection in classical novae 


Laura Chomiuk', Justin D. Linford!, Jun Yang":34, T. J. O’Brien®, Zsolt Paragi®, Amy J. Mioduszewski®, R. J. Beswick®, 
G..G, Cheung’, Koji Mukai®’, Thomas Nelson!’®, Valério A. R. M. Ribeiro!!, Michael P. Rupen®”, J. L. Sokoloski!?, 
Jennifer Weston!’, Yong Zheng’, Michael F. Bode’, Stewart Eyres, Nirupam Roy!® & Gregory B. Taylor!” 


Classical novae are the most common astrophysical thermonuclear 
explosions, occurring on the surfaces of white dwarf stars accreting 
gas from companions in binary star systems’. Novae typically expel 
about 107‘ solar masses of material at velocities exceeding 1,000 kilo- 
metres per second. However, the mechanism of mass ejection in novae 
is poorly understood, and could be dominated by the impulsive flash 
of thermonuclear energy’, prolonged optically thick winds’ or bin- 
ary interaction with the nova envelope’. Classical novae are now 
routinely detected at gigaelectronvolt y-ray wavelengths’, suggest- 
ing that relativistic particles are accelerated by strong shocks in the 
ejecta. Here we report high-resolution radio imaging of the y-ray- 
emitting nova V959 Mon. We find that its ejecta were shaped by the 
motion of the binary system: some gas was expelled rapidly along the 
poles as a wind from the white dwarf, while denser material drifted 
out along the equatorial plane, propelled by orbital motion®”’. At the 
interface between the equatorial and polar regions, we observe syn- 
chrotron emission indicative of shocks and relativistic particle accel- 
eration, thereby pinpointing the location of y-ray production. Binary 
shaping of the nova ejecta and associated internal shocks are expected 
to be widespread among novae’, explaining why many novae are 
y-ray emitters”. 

The identification of the y-ray transient J0639+0548, detected by 
NASA’s Fermi Gamma-ray Space Telescope, with the classical nova 
959 Mon’ wasa surprise, because gigaelectronvolt y-rays are produced 
by the inverse Compton mechanism, the pion production mechanism 
or both’, requiring a population of relativistic particles which had not 
been predicted or observed in normal classical novae. Gigaelectron- 
volt y-rays had been reported from only one nova before V959 Mon, in 
a system with an unusual Mira giant companion, dense circumbinary 
material and, thereby, a strong shock interaction between the nova 
ejecta and the surroundings’®. The white dwarf in V959 Mon, however, 
has a main-sequence companion and, therefore, a low-density circum- 
binary environment'’”’, and so there is no apparent mechanism for dif- 
fusive shock acceleration in an interaction with surrounding material. 

The y-ray emission from V959 Mon was discovered on 2012 June 19 
(day 0) and lasted ~12 d, showing a soft-spectrum continuum’. Little 
is known about V959 Mon during the period of y-ray emission, owing 
to its solar conjunction in the first few months of outburst, which pre- 
vented optical observations; the transient was not even identified as a 
nova until 56 d after y-ray discovery°. However, we obtained early radio 
observations coincident with the Fermi detections using the Karl G. 
Jansky Very Large Array (VLA), just 12 and 16 dafter discovery (Fig. 1). 


These observations span a frequency range of 1-6 GHz and show a flat 
radio spectrum (% ~ —0.1, where S, «x v% vis the observing frequency 
and S, is the flux density at this frequency). This spectral index is much 
more consistent with synchrotron radiation than the expected optic- 
ally thick emission from warm nova ejecta’*"* (a ~ 2 is predicted and 
observed in V959 Mon at later times; Fig. 1 and Extended Data Fig. 1). 

Like gigaelectronvolt y-rays, synchrotron emission requires a popu- 
lation of relativistic particles, and so we can use this radio emission as a 
tracer of y-ray production that lasts longer and enables much higher 
spatial resolution than do the y-rays themselves. The location of the y- 
ray-producing shocks was revealed by milliarcsecond-resolution radio 
imaging using very-long-baseline interferometric (VLBI) techniques, 
which are sensitive to high-surface-brightness synchrotron emission. VLBI 
observations were achieved with the European VLBI Network (EVN) 
and the Very Long Baseline Array, and spanned 2012 September 18 to 
October 30 (91-133 d after y-ray discovery, 2-7 mas resolution; where 
one milliarcsecond (1 mas) ~ 2 X 1013 cmat the distance of V959 Mon"; 
Extended Data Table 3). The first VLBI epoch revealed two distinct knots 
of emission separated by 36 mas, which we subsequently observed to 
travel away from one another at an estimated rate of ~0.4masd'' 
(Fig. 2a). In addition, a third radio component appeared in our imaging 
from day 113. The brightest component was slightly resolved by the Very 
Long Baseline Array on day 106 (Extended Data Fig. 2), and hada peak 
brightness temperature, ~2 X 10° K, indicative of non-thermal emis- 
sion (X-ray observations from around this time’* imply that hot shocked 
thermal gas can account for only <10% of the radio flux density seen 
in the VLBI knots; Supplementary Information). 

Observations made with the e- MERLIN array (54 mas resolution) just 
before the first VLBI epoch shows that the compact VLBI components 
were embedded in a larger-scale structure which was mostly extended 
east-west (and not detected in the VLBI imaging, because these high- 
resolution arrays with widely separated antennas are not sensitive to 
such diffuse emission; Fig. 2b). This diffuse emission is interpreted as 
bremsstrahlung from the bulk of the nova ejecta'*"*. Whereas the 5 GHz 
flux density detected in our VLBI imaging was roughly constant or de- 
clined with time, the flux density detected in the lower-resolution ob- 
servations rapidly increased during this period (Fig. 1). The VLBI knots 
comprised 19% and 9% of the total 5 GHz flux density on days 91 and 
117, respectively, implying that over time the synchrotron emission be- 
comes overwhelmed by thermal emission from the warm ejecta. Although 
synchrotron radio emission has been detected from outbursting novae 
with red giant companions and dense circumbinary material'*’, it has 
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Figure 1 | Radio light curves and spectra of 
V959 Mon. a, The multi-frequency light curve 
(Extended Data Tables 1 and 2) is as expected for 
expanding thermal ejecta, except at the earliest 
times (<30 d) and lowest frequencies (<2 GHz; 
Supplementary Information). The time of 

optical identification is marked (hatched line). 
The times corresponding to three select radio 
spectra are marked with grey bars. b-d, Early-time 
flat spectrum (16 d after Fermi discovery; 

b), transitioning to an optically thick thermal 
spectrum (day 145; c), and a late thermal spectrum 
that is mostly optically thin (day 542; d). Error bars 
denote 1o uncertainties. Downward-facing 
arrows denote 30 upper limits. 


Figure 2 | Radio imaging of 

V959 Mon. a, Illustration of the 
expansion of the compact radio 
knots, with a high-resolution 5 GHz 
EVN image from 113d after y-ray 
discovery shown in colour, and 
contours representing the EVN 
image from day 91. Contour levels 
span 0.125-2 mJy per beam in steps 
of 0.125 mJy per beam. b, 5.8 GHz 
e-MERLIN colour image of thermal 
nova ejecta on day 87. The 

compact VLBI knots from day 91 are 
superimposed as contours, with 
levels as in a. c, Components similar 
to those in b, but one month later, 
with a 36.5 GHz VLA colour image 
of the thermal ejecta on day 126. 
The VLBI knots from day 113 are 
contours. d, Expansion and flip of the 
thermal nova ejecta, comparing VLA 
images from four months and, 
respectively, two years after outburst. 
The 17.5 GHz image from day 615 is 
shown in colour, overplotted with 
the 36.5 GHz day-126 image now in 
black contours (3.2, 9.6, 16, 22.4 and 
28.8 mJy per beam). In all panels, 
the presumed location of the binary 
is marked as a white cross. Scale bars 
in b and d assume a distance of 

1.5 kpc (ref. 11). Synthesized beams 
for contours are shown in the bottom 
left corners of a and d. White 

date labels are for colour images; 
grey labels are for contours. 
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Figure 3 | Simple illustration of the 2012 outburst of V959 Mon. 

a, Immediately following the thermonuclear runaway, the nova envelope 
expands (yellow ellipse) and interacts with the binary system, yielding dense 
material in the equatorial plane” (darker yellow; here orientated vertically). 
Denser bow shocks surround this puffy equatorial disk’ (orange lines). b, As the 
nova outburst progresses, the white dwarf powers a fast wind*”* that is 
funnelled towards the low-density poles®’ (blue cones; compare with the 
thermal emission imaged with VLA on day 126; Fig. 2c). The differential 
velocity produces shocks* (orange lines), and at the edges of the optically thick 
ejecta the shocked material yields compact radio knots (red blobs; compare 
with VLBI knots in Fig. 2). c, Once the white dwarf wind ceases, the polar 
outflow will detach from the binary and quickly drop in density as it expands 
(blue cones). The slower-expanding equatorial material will remain dense 

for longer (yellow regions), and will dominate the radio images at late times 
(compare with day-615 VLA imaging; Fig. 2d). 


not previously been securely identified in novae with main-sequence 
companions”, owing to a paucity of high-resolution radio imaging enab- 
ling components of differing surface brightness to be clearly distinguished. 

The expanding thermal ejecta were resolved with the VLA when it 
entered its high-resolution A configuration. An image from 2012 Octo- 
ber 23 (day 126; 43 mas resolution) shows that the ejecta have expanded 
and assumed a clearly bipolar geometry consistent with analyses of optical 
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spectral line profiles'*”° (Fig. 2c). The apparent geometry of the ejecta is 
conveniently simplified, because we view the orbital plane of V959 Mon 
edge-on”’. Our imaging illustrates that the VLBI knots were not simple 
jet-like protrusions from the thermal ejecta. First, there were three VLBI 
knots when only two are expected from a bipolar jet structure. Second, 
the major axis of the thermal ejecta (directed east-west) was not well 
aligned with the expansion of the VLBI knots, but was offset by 45°. 
In addition, the thermal ejecta expanded faster than the VLBI knots 
(0.64 mas d— 1 in diameter; Extended Data Fig. 3). Finally, because the 
warm thermal ejecta were optically thick at the time of this imaging, 
the VLBI knots were superimposed around the edges of the ejecta, ap- 
pearing to surround the two thermal lobes. 

The origin of the compact radio knots was clarified when we revisited 
V959 Mon sixteen months later, when the VLA was next in its A con- 
figuration (2014 February 24; day 615; Fig. 2d). The much-expanded 
thermal ejecta maintained a bi-lobed morphology, but the axis of elon- 
gation had rotated so that the brightest regions were oriented north- 
south, perpendicular to the outflow observed in 2012. The position angle 
of the VLBI knots lay roughly halfway between that of the early and late 
axes of ejecta expansion (Fig. 2). 

This apparent rotation of the thermal ejecta between day 126 and 
day 615 was due to the outflow being faster along the east-west axis, 
with the result that the east-west lobes became optically thin first. Just 
such an asymmetry is predicted by hydrodynamic simulations of inter- 
acting winds shaped by orbital evolution’. In this scenario, binary stars 
orbiting within the nova envelope transfer some of their orbital energy 
to the surrounding material through viscous interaction, thereby expel- 
ling the envelope preferentially along the orbital plane**”’ (Fig. 3a), cor- 
responding to a north-south orientation in V959 Mon. This equatorial 
material is observable as thermal ejecta, but it expands relatively slowly, 
and its compact structure therefore proves difficult to image at early 
times. Meanwhile, a fast, prolonged wind is blown off the white dwarf’, 
and this thermal wind preferentially expands in the low-density polar 
directions’ (Fig. 3b). At early times, while the ejecta are optically thick, 
this fast material expanding along the poles will dominate the radio images, 
as in Fig. 2c. Later, when the thermal radio emission becomes optically 
thin, the dense material in the orbital plane will be brightest (Figs 2d 
and 3c). A similar 90° flip of the major axis has been hinted at in radio 
imaging of other novae*”, suggesting that such a transformation may 
be common in classical novae. 

The VLBI knots, and, by extension, the y-ray emission, appear to be 
produced in the interaction between the rapidly expanding material 
driven along the poles and the slower equatorial material (Fig. 3c). This 
interaction within the ejecta could explain the prolonged duration of the 
radio synchrotron emission’®, which lasts as long as the fast wind flows 
past the dense material concentrated in the orbital plane. 

The mass ejection observed in V959 Mon isa version of the common- 
envelope phase that occurs in all close binary stars, and is a critical step 
in the formation of diverse phenomena like X-ray binaries, type Ia super- 
novae and stripped-envelope supernovae. Despite its widespread signi- 
ficance, common-envelope evolution remains one of the most poorly 
understood phases of binary evolution, with few observational tests and 
models that often fail to expel the envelope at all?”**. V959 Mon shows 
that classical novae can serve as a test-bed for developing an under- 
standing of common-envelope evolution, and that common-envelope 
interaction has a role in the ejection of nova envelopes. 

An extensive multi-wavelength observational campaign shows V959 
Mon to bea typical classical nova. Its expansion velocities, spectral line 
profiles, binary period, binary companion and optical light curve fall 
well within expected ranges'””**". Additionally, after the few early epochs 
showing a flat radio spectrum, the radio light curve of V959 Mon became 
consistent with thermal emission from expanding warm ejecta, imply- 
ing 4 X 10 ° solar masses of ejecta (a typical value for a classical nova’; 
Extended Data Fig. 4 and Supplementary Information). The only un- 
usual characteristic of V959 Mon is its proximity; at a distance of < 2 kpc 
(ref. 11), it is several times closer than the typical nova which explodes 
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in the Galactic bulge (~8 kpc distant). Therefore, y-rays could bea com- 
mon feature of normal classical novae. 

Since the outburst of V959 Mon, three additional classical novae have 
been identified with Fermi>*”, further implying that V959 Mon is not 
unusual, and many novae produce y-rays. The recent increase in the rate 
of Fermi detections of novae can probably be explained by a combi- 
nation of deeper, targeted detection efforts and a lucky crop of nearby 
novae; with effort, the sample of y-ray-detected novae will continue to 
grow. The mechanism we propose here for powering the y-ray emis- 
sion in V959 Mon—binary interaction shaping nova ejecta and power- 
ing strong internal shocks—works in most novae, implying that each of 
these garden-variety explosions accelerates particles to relativistic speeds. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 


Received 23 June; accepted 12 August 2014. 
Published online 8 October 2014. 


1. Gehrz,R.D., Truran, J. W., Williams, R. E. & Starrfield, S. Nucleosynthesis in classical 
novae and its contribution to the interstellar medium. Publ. Astron. Soc. Pacif. 110, 
3-26 (1998). 

2. Starrfield, S., Truran, J. W., Sparks, W. M. & Kutter, G. S. CNO abundances and 
hydrodynamic models of the nova outburst. Astrophys. J. 176, 169-176 (1972). 

3. Kato, M. & Hachisu, |. Optically thick winds in nova outbursts. Astrophys. J. 437, 
802-826 (1994). 

4. MacDonald, J. The effect of a binary companion on a nova outburst. Mon. Not R. 
Astron. Soc. 191, 933-949 (1980). 

5. TheFermi-LAT collaboration. Fermi establishes classical novae as a distinct class of 
y-ray sources. Science 345, 554-558 (2014). 

6. Soker, N. & Livio, M. Interacting Winds and the shaping of planetary nebulae. 
Astrophys. J. 339, 268-278 (1989). 

7. Porter,J.M., O’Brien, T. J. & Bode, M. F.Onthe asphericity of nova remnants caused 
by rotating white dwarf envelopes. Mon. Not. R. Astron. Soc. 296, 943-948 (1998). 

8. Shankar, A., Livio, M. & Truran, J. W. The common envelope phase in classical 
novae: one-dimensional models. Astrophys. J. 374, 623-630 (1991). 

9. Dubus, G.Gamma-ray binaries and related systems. Astron. Astrophys. Rev. 21, 64 
(2013). 

10. Abdo, A. A. et a. Gamma-ray emission concurrent with the nova in the symbiotic 
binary V407 Cygni. Science 329, 817-821 (2010). 

11. Munari, U. et al. Photometric evolution, orbital modulation and progenitor of Nova 
Mon 2012. Mon. Not R. Astron. Soc. 435, 771-781 (2013). 

12. Hoard, D. W. etal. Nova-like cataclysmic variables in the infrared. Astrophys. J. 786, 
68 (2014). 

13. Seaquist, E. R. & Bode, M. F. in Classical Novae (eds Bode, M. F. & Evans, A.) 
141-166 (Cambridge Univ. Press, 2008). 

14. Roy, N. et al. Radio studies of novae: a current status report and highlights of new 
results. Bull. Astron. Soc. India 40, 293-310 (2012). 

15. Nelson, T. et al. X-ray and UV observations of Nova Mon 2012. Astron. Telegram 
4321 (2012). 

16. Seaquist, E. R. et a/. A detailed study of the remnant of nova GK Persei and its 
environs. Astrophys. J. 344, 805-825 (1989). 

17. O’Brien, T. J. etal. An asymmetric shock wave in the 2006 outburst of the recurrent 
nova RS Ophiuchi. Nature 442, 279-281 (2006). 


342 | NATURE | VOL 514 | 16 OCTOBER 2014 


18. Kantharia, N. G. et al. Rapid rise in the radio synchrotron emission from the 
recurrent nova system V745 Sco. Astron. Telegram 5962 (2014). 

19. Ribeiro, V.A.R. M., Munari, U. & Valisa, P. Optical morphology, inclination, and 

expansion velocity of the ejected shell of Nova Monocerotis 2012. Astrophys. J. 

768, 49 (2013). 

20. Shore, S. N. et a/. The spectroscopic evolution of the y-ray emitting classical nova 

ova Mon 2012. |. Implications for the ONe subclass of classical novae. Astron. 

Astrophys. 553, A123 (2013). 

21. Page, K. L. et al. The 7.1 hr X-ray-ultraviolet-near-infrared period of the y-ray 

classical Nova Monocerotis 2012. Astrophys. J. 768, L26 (2013). 

22. Livio, M., Shankar, A., Burkert, A. & Truran, J. W. The common envelope phase in the 

outbursts of classical novae. Astrophys. J. 356, 250-254 (1990). 

23. Lloyd,H.M., O’Brien, T. J. & Bode, M. F. Shaping of nova remnants by binary motion. 
Mon. Not. R. Astron. Soc. 284, 137-147 (1997). 

24. Taylor, A. R., Hjellming, R. M., Seaquist, E.R. & Gehrz, R. D. Radio images of the 
expanding ejecta of nova QU Vulpeculae 1984. Nature 335, 235-238 (1988). 

25. Eyres, S.P.S., Davis, R. J. & Bode, M. F. Nova Cygni 1992 (V1974 Cygni): MERLIN 
observations from 1992 to 1994. Mon. Not R. Astron. Soc. 279, 249-256 (1996). 

26. Heywood, |., O’Brien, T. J., Eyres, S. P. S., Bode, M. F. & Davis, R. J. V723 Cas (Nova 
Cassiopeiae 1995): MERLIN observations from 1996 to 2001. Mon. Not. R. Astron. 
Soc. 362, 469-474 (2005). 

27. Passy, J. et al. Simulating the common envelope phase of a red giant using 
smoothed-particle hydrodynamics and uniform-grid codes. Astrophys. J. 744, 52 
(2012). 

28. Ivanova, N. et al. Common envelope evolution: where we stand and how we can 
move forward. Astron. Astrophys. Rev. 21, 59 (2013). 

29. Cheung,C.C.,Jean, P. & Shore, S. N. Fermi-LAT y-ray observations of Nova Centauri 
2013. Astron. Telegram 5649 (2013). 


Supplementary Information is available in the online version of the paper. 


Acknowledgements The National Radio Astronomy Observatory (NRAO) is a facility of 
the US National Science Foundation (NSF) operated under cooperative agreement by 
Associated Universities, Inc. The EVN is a joint facility of European, Chinese, South 
African and other radio astronomy institutes funded by their respective national 
research councils. The EVN and e-VLBI research infrastructures were supported by the 
European Commission Seventh Framework Programme (FP/2007-2013) under grant 
agreements nos 283393 (RadioNet3) and RI-261525 (NEXPReS). e-MERLIN is 
operated by The University of Manchester at Jodrell Bank Observatory on behalf of the 
Science and Technology Facilities Council. The SMA is a joint project between the 
Smithsonian Astrophysical Observatory and the Academia Sinica Institute of 
Astronomy and Astrophysics. Support for CARMA construction came from the Moore 
Foundation, the Norris Foundation, the McDonnell Foundation, the Associates of the 
California Institute of Technology, the University of Chicago, the states of California, 
Illinois and Maryland, and the NSF. Ongoing CARMA development and operations are 
supported by the NSF and by the CARMA partner universities. L.C. is a Jansky Fellow of 
the NRAO. This research received funding from NASA programmes DPR S-15633-Y 
and 10-FERMI10-C4-0060 (C.C.C.), NASA award NNX13A091G (T.N.), NSF award 
AST-1211778 (J.LS. and J.W.), the South African SKA Project (V.A.R.M.R.) and the 
Alexander von Humboldt Foundation (N.R.). 


Author Contributions L.C. wrote the text. L.C., J.D.L., J-Y., T.J.0., Z.P., AJ.M., C.C.C., RJ.B., 
T.N,, Y.Z., J.W. and G.B.T. obtained and reduced the data. All authors contributed to the 
interpretation of the data and commented on the final manuscript. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Readers are welcome to comment on the online version of the paper. Correspondence 
and requests for materials should be addressed to L.C. (chomiuk@pa.msu.edu). 


©2014 Macmillan Publishers Limited. All rights reserved 


METHODS 

VLA. We observed V959 Mon with the VLA from Jun 2012” until the time of pub- 
lication as part of programmes $4322, 12B-375, 13A-461, 13B-057 and S61420. Mon- 
itoring was carried out in the L (1-2 GHz), C (4-8 GHz), Ku (12-18 GHz), and Ka 
(26-36 GHz) bands across all array configurations*’. In the C, Ku and Ka bands, 
we observed with two sidebands, each composed of eight spectral windows with 64 
2 MHz-wide channels and four polarization products. Each sideband had a band- 
width of 1 GHz, and the two sidebands were separated to enhance our frequency 
coverage. In the L band, we obtained only 1 GHz of bandwidth in total, covering 
the entire available frequency range in this band. 

In the first three epochs, the entire positional error circle of the Fermi transient 
was observed (95% confidence radius of 11 arcmin), with a single pointing in the 
Lband anda seven-pointing mosaic in the C band. Later, after identification of the 
optical nova, a single pointing was centred on V959 Mon, typically yielding ~10- 
15 min on source in each band and epoch. 

We observed J0632+ 1022 as the secondary phase calibrator in the L and C bands, 
and J0643+0857 in the Ku and Ka bands. 3C147 was used as an absolute flux den- 
sity and band-pass calibrator. Data were reduced using standard routines in CASA 
and AIPS, and in most cases a single round of phase-only self-calibration was im- 
plemented with 1 min solution intervals. Imaging was carried out in CASA”, AIPS*” 
and Difmap™, typically using a Briggs robust value” of 1. Flux densities were ob- 
tained by fitting a Gaussian to the source, using JMFIT in AIPS or gaussfit in CASA. 
Measurements are presented in Extended Data Table 1. Uncertainties include a 
5% calibration error in the L and C bands and a 0% calibration error in the Ku and 
Ka bands. 

In the A configuration epochs, which provide the highest angular resolution, spe- 
cial attention was devoted to imaging. To achieve the highest possible resolution, 
the VLA images shown in Fig. 2 were produced in Difmap with uniform weighting. 
The 36.5 GHz image from 2012 October 23 featured in Fig. 2c has a synthesized beam 
of 44 mas X 42 mas full-width at half-maximum (FWHM) and a root mean squared 
(r.m.s.) sensitivity of 73 ty per beam. The 17.5 GHz image from 2014 February 24 in 
Fig. 2d has a0.125” X 0.09” FWHM synthesized beam and r.m.s. noise of 19 Jy per 
beam. When smoothed to similar resolution, images at other frequencies show sim- 
ilar structure on this date. We note that during the first A configuration (October 
2012-January 2013), the nova was not spatially resolved at the lower frequencies 
(L and C bands). In the second A configuration (February 2014), V959 Mon was 
slightly resolved in the C band and remained unresolved in the L band. 
Millimetre data. The 225 GHz flux density of V959 Mon was monitored using the 
Submillimetre Array (SMA) and Combined Array for Research in Millimeter-wave 
Astronomy (CARMA). Under SMA programme 2012A-S016, observations were 
obtained from September to December 2012. CARMA observations were obtained 
at 96 and 230 GHz during May and August 2013 under programme c1130. Data 
were reduced using standard routines in MIRIAD*, and measurements are listed 
in Extended Data Table 2. Uncertainties include calibration errors of at least 10%. 

In addition, we used high-frequency measurements obtained with the Institute 
for Radio Astronomy in the Millimeter Range (IRAM) 30 m telescope and the Plateau 
de Bure Interferometer (PdBI)*’ (Extended Data Table 2). These measurements 
additionally constrained our radio/millimetre spectrum fitting and light curve mod- 
elling, as shown in Extended Data Figs 1 and 4. 
e-MERLIN. The e-MERLIN array was used to observe V959 Mon on a number of 
occasions, starting in September 2012, when the VLA detected an increase in radio 
brightness. The first epoch (Fig. 2b) combined observations from 2012 September 
13 and 14. The observations used six telescopes of the e- MERLIN array. They were 
made in the C band, centred at 5.75 GHz with a bandwidth of 512 MHz per polari- 
zation. This total bandwidth was split into four adjacent, 128 MHz-wide sub-bands 
and correlated with 512 channels per sub-band. We observed J0645+ 0541 as a phase 
calibrator, 3C286 as a flux calibrator and OQ208 as a band-pass calibrator. Data were 
calibrated and reduced with standard tasks in AIPS. The data were phase self- 
calibrated, and imaging was carried out in AIPS using a Briggs robust parameter of 1. 
The synthesized beam used to restore the image is 65 mas X 45 mas. The peak flux 
density in this map is 3.7 mJy per beam, with r.m.s. noise of 53 Jy per beam. The 
total flux density is estimated as 7.0 + 0.5 mJy (obtained via Gaussian fitting to the 
image data), which is consistent with estimates made from VLA observations on 
shorter baselines around the same time. 

EVN. We performed the EVN observations of V959 Mon in the C band (5.0 GHz) 
in five epochs spanning September 2012 to January 2013. Observations were carried 
out with a data rate of 1,024 Mbit s_' and 2-bit sampling, yielding dual polarization 
and a 128 MHz bandwidth. Observation duration was ~7 h per epoch. The first 
three observations were EVN Target of Opportunity experiments during the e-EVN 
sessions (project codes RO005 and RO006), and the last two epochs were part of 
project EO011. Participating EVN stations were Effelsberg, the phased array of the 
Westerbork Synthesis Radio Telescope, Onsala, Medicina, Noto, Torun, and the 
Lovell telescope; Effelsberg was not functioning during the last epoch owing to 
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snow. Through the wideband internet connection to the EVN stations, all obser- 
vations were correlated in real time at the Joint Institute for VLBI in Europe with 
the following default correlation parameters: 2 s integration time and 32 frequency 
points per sub-band. 

We observed J0645+0541 as the phase-referencing source in all our epochs. The 
J2000 coordinates were RA = 06 h 45 min 47.27653 s and dec. = 05° 41’ 22.3857’' 
(absolute 1¢ positional uncertainty: 1.1 mas in RA and 2.0 mas in dec.). The sepa- 
ration between the nodding calibrator and V959 Mon was 1.54°. We also observed 
a secondary calibrator as an astrometric check source, which implied a 1¢ system- 
atic position error of 0.43 mas in RA and 0.70 mas in dec. We took a nodding cycle 
time of 1 min for the calibration and 3 min for the target. 

Initial data calibration was carried out in AIPS using standard routines. After 
this calibration, the data were averaged in each sub-band. Imaging and deconvo- 
lution were then carried out in Difmap. To remove the phase error associated with 
the source structure, we first imaged the reference source J0645+ 0541 and then re- 
did fringe-fitting with the calibrator image. The source J0645 +0541 showed a single 
side core-jet structure with a total flux density of 175 + 18 mJy. The centroid of the 
radio core was determined by Gaussian model fitting and later used as the image 
reference origin. We imaged V959 Mon with natural weighting in Difmap. 

The intensity image in the first epoch (2012 September 18) had a synthesized beam 
with FWHM 9.2 mas X 5.5 mas at a position angle of —38.0° and an r.m.s. sensitivity 
of 0.03 mJy per beam. A pair of knots was clearly detected** (Extended Data Table 3). 
Here we name the brighter (eastern) one A and the weaker (western) one B; this 
image is shown as white contours in the top panel of Fig. 2. Knots A and B had an 
angular separation of 35.5 + 0.2 mas. After the knots had been fitted with point- 
source models, there still existed some extended emission with a peak brightness of 
0.27 mJy per beam in the residual map. We modelled the residual extended emission 
with two circular Gaussian components (respective FWHMs of 16.4 and 29.9 mas). 
Each knot was associated with one extended emission region. 

We detected the pair of radio knots again in the second epoch (2012 October 10), 
along with a new, third knot (Fig. 2a). Knots A and B had faded since the first epoch 
(Extended Data Table 3). Knot A was resolved and its intensity distribution was 
well fitted by two point sources (A1 and A2) with a separation of 5.7 + 0.2 mas at 
position angle —179.5°. The separation was 45.2 + 0.4 mas at position angle 131.9° 
between A1 and B, and was 49.1 + 0.4 masat position angle 137.0° between A2 and 
B. The new, third knot, dubbed C, was located east of A, and was brighter than 
knot B in this epoch. After fitting and removing the knots, there was some residual 
large-scale structure showing as regular stripes, most probably a hint of the extended 
emission seen with the VLA. The synthesized beam hada FWHM of 8.4 mas X 6.0 mas 
at a position angle of —44.3°. 

We find that radio knots A and B both appeared to be moving away from a 
central position. Between 2012 September 18 and 2012 October 10, they were mea- 
sured to recede from one another at 0.50 masd7!. 

We failed to detect any compact features associated with V959 Mon in three EVN 
epochs (2012 November 14, 2012 December 4 and 2013 January 15), because of its 
low peak brightness and significant expansion. We place 50 upper limits on the 
peak flux density from compact knots of <0.25 mJy per beam on 2012 November 
14, <0.15 mJy per beam on 2012 December 4 and <0.18 mJy per beam on 2013 
January 15. There was a hint of extended emission in the raw image, but it was not 
possible to locate the position, owing to strong side lobes of the synthesized beam. 
The source was only weakly detected on the shortest and the most sensitive base- 
line (Effelsberg—-Westerbork) in the third and fourth epochs. V959 Mon was ob- 
scured by noise in the last EVN epoch. 

Positions and fluxes of the EVN components are listed in Extended Table 3. 
Very Long Baseline Array. We observed V959 Mon with the Very Long Baseline 
Array (VLBA) on 2012 October 3, October 14, October 30 and November 17 under 
the NRAO project code BM0385. For the first three epochs, we observed in both 
the L and the C band, and the final epoch was in the C band. Each observation had 
eight spectral windows covering a total bandwidth of 256 MHz via the new ROACH 
digital backend (RDBE) and yielded a total on-source time of ~100 min per fre- 
quency band. For the first epoch, each spectral window had eight channels with 
4 MHz of bandwidth per channel. For the subsequent observations, we had 64 chan- 
nels per spectral window with 500 kHz per channel. The first three epochs had cen- 
tre frequencies of 1.55 and 4.98 GHz for the L and C bands, respectively. The final 
C-band epoch had a centre frequency of 4.24 GHz. The C-band phase reference 
source was J0650+ 0358 for the first two epochs and J0645+0541 for the final two 
epochs (changed to match the calibrator used for EVN observations). The L-band 
phase reference source was J0650 + 0358 for all epochs. The data were reduced using 
standard routines in AIPS, and images were made using both AIPS and Difmap. 

In our first VLBA epoch, we detected VLBI knots A and B in both the 5.0 GHz 
data and 1.6 GHz data. For the 5.0 GHz image (3.5 mas X 1.5 mas synthesized beam; 
r.m.s. sensitivity, 27 Ly per beam), knot A was not well fitted with a point source 
and was therefore modelled with a circular Gaussian, whereas knot B was well fitted 
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by a point-source model (Extended Data Table 3). We also noticed a possible third 
radio knot approximately 7.7 mas north of the brightest component. For the 1.6 GHz 
image (11 mas X 5 mas synthesized beam; r.m.s. sensitivity, 49 Jy per beam), both 
component A and component B were diffuse and modelled with circular Gaussians. 
There was no sign of the third component in the 1.6 GHz image. 

For the second VLBA epoch, we detected the three EVN knots in the 5.0 GHz 
image (3.8 mas X 1.7 mas synthesized beam; r.m.s. sensitivity, 37 Jy per beam). 
Knots A and B had faded with respect to the first VLBA observation, and knot C 
was seen to the east of knot A. Knot A was best fitted by two point-source models 
and a circular Gaussian component to account for diffuse emission. Knots B and C 
were well fitted by single point sources. In the 1.6 GHz data (12 mas X 6 mas syn- 
thesized beam; r.m.s. sensitivity, 27 ty per beam), only knot B was obvious. It was 
again fitted with a circular Gaussian. There was a second small knot in the vicinity 
of knot A, and this was also fitted with a circular Gaussian, but we do not believe it 
accounts for the total flux in knot A. 

Unfortunately, the phase reference source did not produce strong fringes on the 
longer baselines at 5.0 GHz for the third and fourth VLBA epochs. In addition, one 
antenna was inoperative during each of these observations. The reduced sensitivity 
and lower resolution led to problems identifying distinct components, and the change 
in phase reference source made it difficult to compare positions between the first 
and last two epochs. For the third epoch at 5.0 GHz, knot A had a flux density of 
1.0 + 0.1 mJy, whereas knot B was not detected with high confidence (upper limit 
of 0.14 mJy). At 1.6 GHz, only one component in the vicinity of knot A was detected 
unambiguously, but with a dramatically increased flux density of 1.2 + 0.1 mJy. 
Knot B had an upper limit of 0.14 mJy. The increase in flux density for knot A and 
the drop in flux density for knot B could indicate that the shock associated with knot A 
encountered a region of higher density leading to increased synchrotron emission, 
and that knot B exhausted its supply of relativistic particles. For the fourth epoch, 
no 1.6 GHz data were obtained. The source had dimmed significantly at 5.0 GHz, 
and knot A was just barely detected, with a flux density of 0.16 + 0.05 mJy. Knot B 
was not detected (upper limit of 0.11 mJy). 

Positions and fluxes of the VLBA components are listed in Extended Data Table 3. 
We note that some small disagreement between EVN and VLBA positions is ex- 
pected owing to the use of different phase reference sources. 

Concurrent observations at 1.6 and 5.0 GHz enable the creation ofa spectral index 
map. For the 2012 October 3 observations, we applied a (u, v)-taper to the 5.0 GHz 
data to match resolution with our 1.6 GHz image. The matched-resolution images 
were then aligned via two-dimensional cross correlation, and low signal-to-noise 
regions were blanked. We used the AIPS task SPIXR to produce the map presented 
in Extended Data Fig. 2. Knot A had an overall positive spectral index, with a mean 
a of ~1.2 in the region of high signal-to-noise ratio. Knot B had an overall negative 
spectral index, with a mean value of « ~ —0.3 in the regions of highest signal-to-noise 
ratio. The spectra were generally flatter (« closer to 0) near the edges of the knots. 

The spectral index of knot B appears roughly constant between days 106 and 117 
(Extended Data Table 3). Curiously, knot A appeared to have faded at low frequency 
on day 117 (leading to « > 1.2), but then brightened drastically at 1.6 GHz on day 133 
(yielding « ~ 0). The optical depth of knot A therefore appears to be time variable. 

From our two epochs of 5 GHz data, we can get an independent estimate of the 
expansion rate for knots A and B. As stated previously, the EVN observations ex- 
hibited an expansion rate of ~0.50 mas d~_' between 2012 September 18 and Octo- 
ber 10. Using the VLBA positions listed in Extended Data Table 3, we find an 
expansion rate of ~0.35 mas d__' between 2012 October 3 and October 14. Because 
the VLBA observations were made later than the EVN observations, the lower rate 
may indicate that the expansion was slowing down. However, possible evolution of 
the source structure and the different (u, v) coverages of the instruments also intro- 
duce significant uncertainties into this comparison. Because we have only two epochs 


with detections in each array, it proves difficult to establish the detailed evolution 
of the VLBI knot expansion velocity. 

Expansion of the thermal ejecta: days 127-199. During the first period of VLA 
A configuration, we obtained five epochs spanning 2012 October 23-2013 January 
4 (Extended Data Table 1). V959 Mon was clearly resolved at frequencies of 13.5- 
36.5 GHz. V959 Mon was bright during this period, and imaged at significance 
levels of >100¢ per synthesized beam. In each image, V959 Mon was fitted with a 
Gaussian using the task JMFIT in AIPS. The width and position angle of the Gaussian 
were allowed to vary, and the angular dimensions of V959 Mon were found by 
deconvolving the synthesized beam from the fitted Gaussian. 

Gaussian fits provide simple first-order estimates of the changing dimensions 
of V959 Mon, although the source is not perfectly described with this profile form 
(future work will involve a more detailed analysis of the source geometry). The posi- 
tion angles of the fitted Gaussians were constant in time, consistent across frequency 
(~87°) and distinct from the position angle of the synthesized beams. V959 Mon 
obviously expanded along both its major (east-west) and minor (north-south) axes 
over the three months of A-configuration observations. Extended Data Fig. 3 shows 
the deconvolved radii of V959 Mon over this period, with the semi-major axis plotted 
in the top panel and the semi-minor axis plotted in the bottom panel. Both axes 
were observed to increase with remarkable linearity at all four frequencies; linear 
fits are overplotted. At 13.5 GHz, we find diameter expansion rates of 0.62 mas d~' 
and 0.64 mas d~' along the minor and major axes, respectively. At 36.5 GHz, we find 
diameter expansion rates of 0.66 mas d_' along the minor axis and 0.37 masd~* 
for the major axis. 

At any given epoch, very similar minor-axis radii were measured at all four fre- 
quencies. However, the size of the major axis varied with frequency, with the ex- 
ception of the first epoch. Measurements at 36.5 GHz of the major axis yielded 
systematically smaller radii than did 13.5 GHz measurements (and data points at the 
intermediate frequencies of 17.5 and 28.2 GHz also follow this trend). Assessing all 
five epochs, it is clear that the major axis of V959 Mon grew most slowly at highest 
frequency. 

The material expanding along V959 Mon’s minor axis was consistent with being 
optically thick throughout the A-configuration observations. When optically thick, 
all frequencies display roughly the same radius of the t ~ 1 surface, as we observed 
for material expanding along the minor axis of V959 Mon. However, the frequency 
dependence of V959 Mon’s major axis can be explained if the ejecta expanding along 
the major axis were becoming optically thin over the course of the imaging campaign. 
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Extended Data Figure 1 | Radio/millimetre spectral evolution of V959 Mon. _ function is chosen to minimize the reduced 7° value). The best-fit spectral 
Measurements and lo uncertainties from select epochs are shown as black indices are listed in each panel, along with the break frequency (vp) in the case of 
points. Power-law or broken power-law fits are overplotted as red lines (the broken power-law fits. 
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Extended Data Figure 2 | Spectral index map from 2012 October 3 VLBA observations. The spectral index is measured by comparing images at 1.6 and 5 GHz. 
Overlaid contours are from the 1.6 GHz Stokes I map. Contour levels are —0.08, 0.08, 0.13, 0.16, 0.23, 0.32 and 0.45 mJy per beam. 
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Extended Data Figure 3 | The expansion of V959 Mon as a function of time. 
Semi-major axis (a) and semi-minor axis (b), both in units of milliarcseconds. 
Measurements at four distinct frequencies are plotted in different colours 
(see key). Error bars from JMFIT (1¢) are so small that they are not visible. 
Linear fits are made to each frequency separately, and are plotted as 

coloured lines. 
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Extended Data Figure 4 | Model fit to the radio/millimetre light curve of 
V959 Mon. A simple model of thermal expanding ejecta roughly describes the 
light curve evolution at day ~200 and later, and implies an ejected mass of few 
10° solar masses. Error bars denote 1¢ uncertainty. 
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Extended Data Table 1 | VLA observations of V959 Mon 


Date t—to Conf. Vv Sv 
(UT) (Days) (GHz) (mJy) 

2012 Jun 30.7 11.7 B 1.3 0.10+0.18 
2012 Jul4.7 15.7 B 13 0.11 +0.20 
2012 Jul 28.6 39.6 B 14 0.54+0.14 
2012Sep14 74.4 A 1.3 0.39+0.07 
13.3 20.20 + 2.02 

2012 Sep8.5 81.5 BnA 1.3 1.24+0.10 
2012 Sep 15.4 88.4 BnA 13.5 32.46 + 3.25 
40.6 211.7 + 21.2 

2012 Sep 22.5 95.5 BnA 13.5 32.44+ 3.26 
2012 Sep 27.4 1004 BnA-A 13 2.55+0.14 
2012 Oct 14.7. 117.7 A 13 2.75+0.16 
2012 Oct 23.4 126.4 A 13.6 71.03 + 7.10 
2012 Nov 10.3. 144.3 A 1.3 5.06 + 0.38 
2012 Nov 12.3 146.3 A 13.5 75.70 + 7.60 
2012 Nov 23.4 157.4 A 13 7.61 +0.39 
13.6 70.15 + 7.06 

2012 Dec 7.2 171.2 A 13.5 65.08+6.51 
2012 Dec 8.5 172.5 A 1.3 8.40+0.43 
2013 Jan3.1 = 198.1 A 1.3 8.16+0.41 
2013 Jan 4.3 199.3 A 13.5 62.45+6.25 
2013 Feb 20.1 246.1 D 14 6.22+0.35 
2013 Mar 05.1 259.1 D 13.6 43.87 + 4.39 
2013 Apr 17.0 302.0 D 14 2.93 + 0.36 
2013 Apr 28.9 313.9 D 13.2 30.12+3.01 
2013 Jun1.4 3474 DnC->C 14 2.69+0.37 
13.2 24.86 + 2.49 

2013 Jun 27.6 403.6 Cc 14 1.95+0.19 
13.5 16.86 + 1.69 

2013 Aug 22.5 429.5 Cc 14 1.8140.23 
13.5 14.1341.41 

2013 Oct 04.4 472.4 B 14 1.96+0.14 
13.6 10.7141.07 

2013 Dec 13.2 542.2 B 14 1.88+0.17 
13.6 6.16+0.62 

2014 Feb 24.2 615.2 A 13.6 4.67 + 0.48 
2014 Feb 25.0 616.0 A 14 1.82+0.11 
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4.74+ 0.24 
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16.28 + 0.82 
138.9 + 13.9 
21.45 + 1.36 
116.9 + 11.9 


25.94 + 1.30 
135.3 + 14.0 


133.5 + 13.4 
23.76 + 1.19 
20.17 + 1.01 
134.9 + 13.5 
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74.11 47.41 
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39.48 + 3.95 


10.53 + 0.53 
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109.3 + 10.9 


9.16 + 0.46 


136.6 + 13.7 


124.9 + 12.6 
17.21 + 0.88 
30.99 + 1.57 
173.9+17.4 
34.73 + 1.78 
138.6 + 14.0 


36.60 + 1.83 
156.9+ 16.1 


163.0 + 16.3 
33.51 + 1.68 
30.45 + 1.52 
163.4 + 16.4 
25.83 + 1.29 
83.35 + 8.34 
21.73 + 1.09 
38.69 + 3.87 


16.58 + 0.83 
27.87 + 2.79 


13.30 + 0.67 
15.87 + 1.59 
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Flux density measurements at 1-46 GHz, spanning June 2012-February 2014. The time to corresponds to y-ray discovery, 2012 June 19°. 
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Extended Data Table 2 | Millimetre observations of V959 Mon 


Date t—to Facility Vv Sp 
(UT) (ays) (GHz) (Jy) 


2012 Sep 6 79 IRAM?8 86 4534121 
2012 Sep 6 79 SMA 225 500+ 125 


2012 Sep 8 81 paBre8 87 §=©330+4 33 

2012 Sep22 95 SMA 225 570457 

2012 Oct10 113 SMA 225 284+ 28 

2012 Dec 5 169 SMA 220 287+ 28 
2013 May 21 336 CARMA 96 34.7+3.6 
2013 Aug11 418 CARMA 96 13.8+1.5 
2013 Aug 14 421 CARMA 230 10.2+ 2.0 


Flux density measurements obtained with the SMA, CARMA, IRAM and PdBI telescopes at 86-230 GHz. 
The time to corresponds to y-ray discovery, 2012 June 19°. 
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Extended Data Table 3 | VLBI components of V959 Mon 


Date t—to Facility v Sv RA Pos Dec Pos Phase Ref. 
(UT) (Days) (GHz) (mJy) (06h39mXXs) (05d53’XX’’) — Source 
Component A 
2012Sep18 91 EVN 5.0 1.044+0.11 38.60057 52.8267  J0645+0541 
2012 Oct03 106 VLBA 16 0.53+40.06 38.60077 52.8311 J0650+0358 
2012 Oct03 106 VLBA 5.0 1.66+0.17 38.60087 52.8315 J0650+0358 
2012Oct10 113 EVN 5.0 1.45+0.15 38.60077 52.8225  J0645+0541 
2012Oct14. 117 VLBA 16 0.17+0.05 38.60054 52.7915 J0650+0358 
2012Oct14. 117 VLBA 5.0 1.3140.14 38.60097 52.8327 J0650+0358 
2012 Oct30 133 VLBA 16 1.20+0.13 38.60092 52.82778 J0645+0541 
2012 Oct30 133 VLBA 5.0 1.0340.12 38.60098  52.82325 J0650+0358 
Component B 
2012 Sep 18 91 EVN 5.0 0.42+0.05 38.59886 52.8514 J0645+0541 
2012 Oct03. 106 VLBA 16 0.59+0.07 38.59883 52.8668 J0650+0358 
2012 Oct03 106 VLBA 5.0 0.16+0.03 38.59881 52.8597 J0650+0358 
2012Oct10 113 EVN 5.0 0.3140.04 38.59852 52.8555  J0645+0541 
2012Oct14. 117 VLBA 16 0.52+0.07 38.59865 52.8620 J0650+0358 
2012 Oct14. 117 VLBA 5.0 0.16+0.04 38.59867 52.8626 J0650+0358 
Component C 
2012O0ct10 113 EVN 50 0.3640.05 38.60251 52.8518 J0645+0541 
2012 Oct14. 117 VLBA 5.0 0.14+0.04 38.60267 52.8622 J0650+0358 


Positions and flux densities of VLBI knots, observed with EVN and VLBA. The time to corresponds to y-ray discovery, 2012 June 19°. 
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Giant Rydberg excitons in the copper oxide Cu,O 


T. Kazimierczuk', D. Fréhlich', S. Scheel’, H. Stolz? & M. Bayer’? 


A highly excited atom having an electron that has moved into a level 
with large principal quantum number is a hydrogen-like object, termed 
a Rydberg atom. The giant size of Rydberg atoms’ leads to huge 
interaction effects. Monitoring these interactions has provided in- 
sights into atomic and molecular physics on the single-quantum level. 
Excitons—the fundamental optical excitations in semiconductors’, 
consisting of an electron and a positively charged hole—are the 
condensed-matter analogues of hydrogen. Highly excited excitons 
with extensions similar to those of Rydberg atoms are of interest 
because they can be placed and moved in a crystal with high preci- 
sion using microscopic energy potential landscapes. The interac- 
tion of such Rydberg excitons may allow the formation of ordered 
exciton phases or the sensing of elementary excitations in their sur- 
roundings on a quantum level. Here we demonstrate the existence 
of Rydberg excitons in the copper oxide Cu,0, with principal quan- 
tum numbers as large as n = 25. These states have giant wavefunc- 
tion extensions (that is, the average distance between the electron and 
the hole) of more than two micrometres, compared to about a nano- 
metre for the ground state. The strong dipole-dipole interaction 
between such excitons is indicated by a blockade effect in which the 
presence of one exciton prevents the excitation of another in its 
vicinity. 

The Coulomb attraction between a negatively charged electron in 
the conduction band and a positively charged hole in the valence band 
leads to the formation of bound exciton states, which are essential to the 
optical properties of semiconductors’. The exciton in bulk crystals has 
strong similarities with the hydrogen atom, which have been substan- 
tiated by the observation of exciton states with binding energies —Ry/n* 
below the bandgap. Here Ry is the Rydberg energy and n is an integer 
that is analogous to the principal quantum number n of hydrogen. The 
observation of highly excited excitons is typically prevented by a small 
Rydberg energy of a few millielectronvolts (for example, 4.2 meV in 
the prototypical semiconductor GaAs with n = 3 as its highest observed 
state), which is about three orders of magnitude smaller than in hydro- 
gen owing to the small reduced mass of the electron and hole and di- 
electric screening from the many-body surroundings. Therefore, higher 
exciton states are energetically spaced too closely to each other and to 
the ionization continuum to be resolvable. 

In our search for highly excited excitons we have chosen the semi- 
conductor copper oxide (CuO), in which excitons were first observed**, 
facilitated by the comparatively large Rydberg energy of around 100 meV. 
Cu,O has a direct bandgap (see Supplementary Information). The high- 
est valence and the lowest conduction bands are formed from Cu states, 
the 3d and 4s orbitals, respectively. The excitons associated with these 
two bands form the so-called yellow series with energies around 2.1 eV, 
corresponding to a wavelength of 590 nm for excitation by light. Both 
bands have the same parity, and therefore electric dipole transitions for 
excitons with S-type envelope wavefunctions are forbidden’. In con- 
trast, excitons with a P-envelope are dipole-allowed, as outlined in the 
Supplementary Information. In early works*“, the P-exciton series could 
be followed up to n = 9 and over the years has been extended’ to n = 12. 
Going beyond these n numbers would allow us to create the solid-state 
analogue to Rydberg atoms. Rydberg atoms have been intensively studied 


recently’ because of their attractive properties—long lifetimes, strong 
dipolar interactions, and so on—which might pave the way for quan- 
tum information technologies. Very recently*, coupling of the electron 
ina Rydberg atom (with n = 202 and radius of 2 lym) toa Bose-Einstein 
condensate was studied. 

We studied the exciton spectrum in CuO using high-resolution spec- 
troscopy in which the photon energy of a laser with 5-neV linewidth 
(corresponding to about 1.2 MHz) was scanned across the energy range 
of interest and the transmitted laser intensity was measured (see Sup- 
plementary Information). Usually, the success of semiconductors is 
based on extremely high crystal quality achieved by artificial fabrication. 
Oddly, Cu20 artificial crystals are inferior in quality compared to nat- 
ural crystals. We used a CuO crystal with a thickness of 34 um, cut and 
polished from a rock mined at the Tsumeb mine in Namibia. The sam- 
ple was held at a temperature of 1.2 K (see Fig. 1b and c). 

In the top panel of Fig. 1a we present the absorption spectrum of P- 
excitons (that is, an exciton with a P envelope) obtained from the trans- 
mission experiments, revealing a large number of lines. To take a closer 
look at the high-energy part, we zoom into the spectrum with increas- 
ing resolution (see the lower panels of Fig. 1a). The exciton lines are 
labelled by the corresponding principal quantum numbers. Surprisingly, 
we can uniquely identify states with n up to 25, much higher than pre- 
viously reported for a solid state system®. Such a high n caused us to 
investigate the extension of the exciton wavefunction. Using the hy- 
drogen relation, the average radius (r,,) of an orbital with principal quan- 


tum number n and angular momentum 1 is given by (r,) ==ap (3n? — 


I(1+1)), where ag is the Bohr radius and / = 1 for P-states’. The Bohr 
radius for P-excitons is ag = 1.11 nm (ref. 9). For n = 25 we thus get 
(ros) = 1.04 tum, corresponding to a huge exciton extension of more 
than 2 um, about ten times the light wavelength (see Fig. 1d). 

The absorption lines exhibit an asymmetry with a steeper slope on 
the high-energy flank. The asymmetry is due to interference ofa discrete 
excitonic state and a continuum of states from interaction with optical 
phonons'®’, From the corresponding fits to each line, the exciton res- 
onance energies E,, can be accurately determined. These energies, ane 

Y 


in Fig. 2a as functions of n, follow the Rydberg formula E,, = E, — — 


to a good approximation, as shown by the fit in Fig. 2a. From the fit we 
obtain the bandgap energy E, = 2.17208 eV and the Rydberg energy 
Ry = 92 meV. However, our high-resolution spectroscopy reveals a slight 
deviation from the Rydberg series, which can be incorporated by em- 
ploying the concept of quantum defects (see Supplementary Informa- 
tion) such that E, = Ey — pee 
dp = 0.23. (n—d1) 
The linewidths /;,,, shown in Fig. 2b, decrease with increasing n down 
to a few micro-electronvolts. For principal quantum numbers below 
n = 10, the lineshapes can be well described by Lorentzians, suggesting 
homogeneous broadening. Here, the data are in accordance with an 
inverse cubic law of n. For higher n, the lineshape becomes increasingly 
Gaussian; see Fig. 1a. This indicates that the homogeneous broadening 
is superimposed by crystal inhomogeneities that are captured by the 
extended exciton wavefunctions. Yet the linewidth decreases with prin- 
cipal quantum number so that for n = 24 it is as small as 3 eV. The 


with the P-exciton quantum defect 
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Figure 1 | High-resolution absorption spectra of yellow P excitons in CuO. 
a, Spectra are measured with a single-frequency laser on a natural sample of 
thickness 34 1m at 1.2 K. Peaks correspond to resonances with different 
principal quantum number n. The panels below show close-ups of the areas 
marked by rectangles in each panel above. b, Photograph of the natural Cu,O 


homogeneous linewidth I, can be used to derive estimates for the ex- 
citon lifetime t,, in state n using the relation t,,~//T,, which for the 
inhomogeneous case gives a lower limit for t,,. For the highest prin- 
cipal quantum numbers we obtained nanosecond lifetimes. 

These lifetimes seem surprisingly long, given that the huge wave- 
function extension may cause the exciton to be fragile as it is confronted 
with multiple scattering possibilities in the crystal. For low excitation 
powers, carrier—carrier scattering can be neglected. However, two path- 
ways remain for the P-state decay, apart from direct radiative recombi- 
nation. One is relaxation into lower-lying exciton states by spontaneous 
emission of far-infrared photons with energies of a few tens of milli- 
electronvolts. Because the corresponding rate is proportional to the third 
power of photon energy, spontaneous emission is strongly suppressed 
for such transitions compared to the visible range and can be neglected. 
The other pathway is relaxation by emission of optical phonons”. From 
the overlap between the initial and final state exciton wavefunctions, 
the relaxation rate is expected to scale as 1/ n° (ref. 13), in accordance 
with the experiment for the homogeneous contribution. 

From the giant extension, huge Coulomb interaction effects are ex- 
pected, which we access by studying the transmission as function oflaser 
excitation power, and therefore of exciton density. The area of each 


1 
2.17194 


crystal from which samples of different size and crystal orientation were cut. 
c, A large crystal and a thin crystal mounted strain-free in a brass holder. 

d, Wavefunction of the P exciton with n = 25. To visualize the giant extension, 
the corresponding light wavelength is shown as the period of the sine function. 
The bar corresponds to the extension of 1,000 lattice constants. 


absorption peak corresponds to the absorption strength and is deter- 
mined by the exciton oscillator strength. The peak areas are shown in 
Fig. 2c as a function of n for a laser power of P, = 20 LW, correspond- 
ing to an intensity of 6 4.W mm ~. We find that the peak area scales as 
1/n°, but only in the range up to n = 17. This dependence confirms the 
theoretical analysis for isolated P excitons*"*, from which one expects 


2 
n~—1 
for the exciton oscillator strength a behaviour proportional to = 


1 
-; for large n. However, there are pronounced deviations for n> 18 at 
n 


the excitation power we used; the peak areas are reduced by almost an 
order of magnitude compared to the expected values. 

To explore the origin of this reduction in more detail, we measured 
the peak area as a function of excitation intensity. Figure 3a shows cor- 
responding absorption spectra from n = 12 upwards. With increasing 
power, the absorption lines continuously decrease, with the higher-lying 
ones fading away first. The peak areas are plotted as a function of exci- 
tation intensity in Fig. 3b, showing a drop starting from a character- 
istic power level for each principal quantum number. The powers, at 
which the drop starts, shift to lower excitation intensity with increas- 
ing n. These results suggest that interaction effects between excitons 
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Figure 2 | Dependences of experimentally determined parameters of 
P-exciton lines on principal quantum number n, revealing power-law 
behaviour. a, Exciton binding energy: square symbols are the resonance 
energies E,, the solid line represents the n * dependence expected from the 
Rydberg formula with Ry = 92 meV, and bandgap energy E, = 2.17208 eV. 
Uncertainty of fitting of the exciton energy is negligible in the scale of the plot, 
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as shown by small vertical error bars. b, Square symbols are the experimental 
absorption linewidth data (defined as FWHM, full width at half maximum) 
and the solid line shows the n° dependence. c, Square symbols give 
experimental oscillator strength (peak area) data in arbitrary units and the 
solid line shows the n° dependence expected for a single non-interacting 
exciton. 


©2014 Macmillan Publishers Limited. All rights reserved 


Optical density 


2.1714 2.1716 2.1718 
Energy (eV) 
5 
b 10 * 
a 
e 
@ 
= 10° a 
aa Vv 
= rs 
=I 4 
a = 
3 10 z > 
7 e 
o * 
§ 10% 6 . 
a E @ 
E Vv 
10° 
0.01 0.1 1 
Laser intensity (mW mm”) 
c 
a T 
& 100 - 4 
EF : 
@ 10 4 
> E 7 
[e} i. 4 
i= Le 4 
oO 
GS ITE q 
£ E 7 
oO E | 
o C 4 
xe} 
Be 0.1 
Fa { | | | Lt E 
12 14 16 18 20 22 24 
n 


Figure 3 | Reduction of excitonic absorption due to dipole blockade. 

a, Absorption spectra measured with different laser intensities. We note the 
quenching of the high-n resonances when applying stronger laser excitation. 
b, Dependence of oscillator strength (peak area) on laser power for different n 


resonances; solid lines show 


; fits to the data. c, Dependence of 
blockade efficiency S,, on laser power; solid line shows the fit according to n'° 
dependence. d, Absorption spectra in the two-beam experiment. The curves 
presented were measured with different powers of the pump laser and constant 
probe laser power. The energy of the pump laser was fixed at n = 14. e, Change 
of absorption at n = 17 resonance in the two-beam experiment as a function 


are responsible for the reduction of absorption and the increase of trans- 
mission. For larger exciton sizes, the interaction effects begin at smaller 
exciton densities. 

To explain the reduction, we propose a dipole blockade effect sim- 
ilar to the one observed for Rydberg atoms’. The blockade arises from 
the dipole-dipole interactions between Rydberg excitons, depending 
strongly on their separation. If an exciton is created, the energy for ex- 
citing another exciton nearby is shifted by the dipole interaction energy, 
away from the narrow undisturbed absorption line. Thereby a dipole 
blockade is established: Resonant absorption and exciton creation are 
no longer possible inside the blockade volume Vplockade in which the 
dipole interaction energy is larger than the absorption line width I, . 
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of the pump laser photon energy (pump intensity 0.3 mW mm”). The red line 
is a guide for the eye. The grey line presents a single-beam absorption spectrum 
in this energy range, proving that the absorption is quenched more strongly 
when the pump laser is tuned to an exciton resonance. We note the slight line 
shifts between the resonances in the two spectra arising from exciton-exciton 
interaction. Owing to the variation of the exciton separation in the laser 

spot, these shifts cannot be assessed quantitatively, for which one would have to 
control the exciton position. f, Influence of a magnetic field on the Rydberg 
blockade. Shown are the optical densities at B= 0 T and B = 0.8T as 
functions of excitation intensity. 


As a consequence, the absorption « at a given exciton density px in 
the illuminated crystal volume is reduced by a factor (1 — Px Volockade) 
compared to the absorption o% (a) of the unexcited crystal: «(P, iw) = 
a(t) (1 — px Volockade). Vice versa, the exciton density px is deter- 
mined by this absorption « times the laser power Py, deposited within 
the exciton lifetime t,,0c1/J”, in the crystal: py oc Pya/I’,. Inserting 
this relation for the exciton density and solving for the absorption « 
allows us to derive the following scaling law for the dependence on laser 
power: 


(ho) 


a(PL, ho) = T+S,P. 


(1) 
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where S,, describes the efficiency with which the absorption at the en- 
ergy of exciton state n is blocked through the presence of excitons in the 
same state. Equation (1) describes the observed dependencies well; see 
Fig. 3b. By fitting the experimental data for the peak area, we can extract 
the S,, values that are shown from n = 12 up to 24 in Fig. 3c. In this high- 
n range, S,, varies enormously, with the principal quantum number 
increasing by more than three orders of magnitude. By fitting the data 
with a power function, we find a dependence on the tenth power of n. 

To understand the strong n-dependence, one has to consider possible 


dipole-dipole interaction mechanisms. At large separations they on be 
Co n 
Re? 
where R is the distance between two P-excitons in state n. For smaller 
distances the interaction becomes resonant and is better described bya 
C3(n 
_ at . One finds that C¢ varies 
with the principal quantum number as n'’, while C; scales as n* (for 
details, see Supplementary Information). The onset criterion for the 
blockade of the dipole interaction energy becoming larger than the ab- 


sorption linewidth leads to a critical blockade radius R.(n) = ¢/ Ca (n) I rs 
From this radius the blockade volume Vplockade = 4/ 3nR° can be derived. 

Taking the above considerations into account, we conclude that the 
dependence of the blockade efficiency S,, on the principal quantum num- 
ber n is determined by the product of blockade volume Vplockade times 


the exciton lifetime t,, 0c 1/1",. From Viplockade oR? | (y Git) on’ 


for both mechanisms of dipolar interaction and I”,,0cn~* we obtain 
an extremely steep increase in the blockade efficiency S,, with increas- 
Volockade 


modelled by a van der Waals interaction energy Eyaw(n) = — 


Forster-type dependence Ep(n) = 


ing principal quantum number, given by its tenth power: S,, oc 


n’°, in excellent agreement with experiment. 


We expect the Coulomb blockade to occur not only for excitons 
with the same n, but also for different n. In effect, the experiment thus 
far resembled a single-beam pump-probe experiment with degenerate 
pump and probe. To test our suggestion further, we implemented anoth- 
er tunable laser with a linewidth of 1 neV (about 250 kHz in frequency) 
so that we could vary the pump and probe photon energies indepen- 
dently. At first, we kept the photon energy of the pump laser fixed at 
then = 14 exciton, while simultaneously sampling the exciton spectrum 
with the probe laser (see Fig. 3d). With increasing pump intensity the 
transmission at all exciton lines increases, accompanied by some line 
broadening. This demonstrates that the blockade works also for off- 
resonant excitons. Furthermore, at a fixed excitation intensity, excitons 
with high principal quantum numbers are more strongly blocked by the 
off-resonant pump than lower-lying ones. Within the excitation spot, 
the separation between excitons varies, contributing to a broad absorp- 
tion background in the two-colour studies owing to the widely varying 
exciton interaction energies. The remaining absorption line arises from 
excitons with interaction energies below the linewidth. 

In a next step, the pump was scanned, and the probe photon energy 
was kept at the n = 17 exciton energy, for which we monitor the change 
of absorption. The goal was to detect a reduced absorption and hence 
increased transmission whenever the pump photon energy hits an ex- 
citon resonance. To demonstrate the effectiveness of the blockade, the 
pump photon energy is varied from n = 6 to n = 10 corresponding to 
Bohr radii up to 100 nm. Whenever the pump laser hits a narrow exciton 
resonance in this energy range the transmission of the probe laser in- 
creases, as demonstrated in Fig. 3e. This proves that at the pump den- 
sities we used no electron-hole plasma is created, but excitons stay 
intact and prevent exciton creation in the n = 17 state. 

Since the efficiency with which an exciton blocks exciton creation in 
its vicinity is determined by its extension, we corroborate our interpre- 
tation by deliberately varying the exciton size. An efficient tool for such 
size variation is the application of a magnetic field, in our case along the 
optical axis (taken as the z-axis). The magnetic field leads to a complex 
splitting of exciton levels with different magnetic quantum numbers 
(the Zeeman effect), which we do not discuss in detail here. Instead, we 


n 
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focus on the Rydberg blockade-related wavefunction engineering. The 
magnetic field adds parabolic confinement potentials for the electron 


eB 
and the hole to the exciton Hamiltonian —— (x* + y*) , whose strength 


is controlled by the field strength B. As a result, the exciton wavefunc- 
tion is squeezed normal to the field. The characteristic length scale for 


h 
the magnetic confinement is the magnetic length ¢. = (5) , which 
e 


for a field strength of 1 T is 25.65 nm; this emphasizes the strong im- 
pact of the magnetic field on the Rydberg exciton extension. 

Figure 3f shows the absorption of the n = 15 exciton, recorded at 
B= 0 and 0.8 T versus the optical excitation intensity. The shift of the 
n= 15 exciton in magnetic field was carefully monitored, so that the 
excitation laser could be kept in resonance with the exciton. Although 
in the absence ofa magnetic field the exciton becomes strongly bleached 
and can hardly be observed for laser intensities exceeding 5 mW mm °, 
under a magnetic field the drop of absorption with increasing laser power 
is much weaker, so that even at the highest applied laser intensity of 
20mW mm ” the absorption is about half that at low intensity. This 
finding is in accord with the expected effect of wavefunction squeezing 
on the Rydberg blockade. 

We believe that the observation of Rydberg excitons opens up a new 
field in condensed-matter spectroscopy. The blockade may be exploited 
in applications such as nonlocal all-optical switching or mesoscopic 
single-photon devices. It will be interesting to work out the similarities 
and differences between Rydberg atoms and Rydberg excitons. The ex- 
citon Bohr radius, for example, is much larger for similar principal quan- 
tum numbers, so that comparable blockade volumes may be reached for 
excitons at considerably smaller n than for atoms. In addition, the light 
wavelength is shrunk in the CuO crystal by the refractive index of 3, 
so that Rydberg excitons permit testing of light-matter interaction de- 
scriptions, such as the electric-dipole approximation. Differences may 
also show up in studies of interaction effects among excitons, which may 
be deliberately induced and controlled by exciting additional excitons 
in particular n states. The relaxation of excitons by phonons may lead 
to the formation of low-n exciton populations with which the Rydberg 
excitons can interact. One of these excitons, the paraexciton, is a prime 
candidate for Bose-Einstein condensation in Cu,O (ref. 15), so that 
recent experiments in atomic physics* could be mimicked by studying 
the interaction of a Rydberg exciton with such a condensate. 

Despite a long-standing discussion’*, molecules formed from two 
excitons have not previously been demonstrated in Cu,O. Rydberg ex- 
citons open up new perspectives owing to their strong dipole-dipole 
interactions’’. Exciton molecules with varying constituents and tunable 
binding energies could be excited'**". The number of excitons forming 
a molecule may be varied to form large cluster-like states or extended 
condensed phases. 

The crystal environment means that Rydberg excitons could permit 
studies that are not possible in atomic physics. For example, the posi- 
tion of individual Rydberg excitons might be accurately controlled by 
applying spatially modulated strain fields to the crystal. Also, additional 
electric or magnetic fields may be applied, with which the interaction 
between Rydberg excitons and their stability could be dynamically con- 
trolled. Rydberg atoms subjected to high magnetic fields mimick hy- 
drogen atoms (ref. 22 and references therein) in white dwarf stars. Such 
a hydrogen-like system in a strong magnetic field represents a non- 
integrable problem leading to chaotic behaviour. Rydberg excitons have 
smaller Rydberg energies, so they should enter a similar regime at very 
low magnetic fields, which are easier to study. 
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The ability to store energy on the electric grid would greatly improve 
its efficiency and reliability while enabling the integration of inter- 
mittent renewable energy technologies (such as wind and solar) into 
baseload supply’ ~*. Batteries have long been considered strong can- 
didate solutions owing to their small spatial footprint, mechanical 
simplicity and flexibility in siting. However, the barrier to widespread 
adoption of batteries is their high cost. Here we describe a lithium - 
antimony-lead liquid metal battery that potentially meets the per- 
formance specifications for stationary energy storage applications. 
This Li||Sb-Pb battery comprises a liquid lithium negative electrode, 
a molten salt electrolyte, and a liquid antimony-lead alloy positive 
electrode, which self-segregate by density into three distinct layers 
owing to the immiscibility of the contiguous salt and metal phases. 
The all-liquid construction confers the advantages of higher current 
density, longer cycle life and simpler manufacturing of large-scale 
storage systems (because no membranes or separators are involved) 
relative to those of conventional batteries”*®. At charge-discharge cur- 
rent densities of 275 milliamperes per square centimetre, the cells 
cycled at 450 degrees Celsius with 98 per cent Coulombic efficiency 
and 73 per cent round-trip energy efficiency. To provide evidence of 
their high power capability, the cells were discharged and charged at 
current densities as high as 1,000 milliamperes per square centime- 
tre. Measured capacity loss after operation for 1,800 hours (more than 
450 charge-discharge cycles at 100 per cent depth of discharge) pro- 
jects retention of over 85 per cent of initial capacity after ten years of 
daily cycling. Our results demonstrate that alloying a high-melting- 
point, high-voltage metal (antimony) with a low-melting-point, low- 
cost metal (lead) advantageously decreases the operating temperature 
while maintaining a high cell voltage. Apart from the fact that this 
finding puts us on a desirable cost trajectory, this approach may well 
be more broadly applicable to other battery chemistries. 

Among metalloids and semi-metals, Sb stands as a promising 
positive-electrode candidate for its low cost (US$1.23 mol’) and rela- 
tively high cell voltage when coupled with an alkali or alkaline-earth 
negative electrode’. In previous work®, we demonstrated the perform- 
ance of a Mg||Sb liquid metal battery at current densities ranging from 
50 to 200 mA cm”, achieving a round-trip energy efficiency of up to 69%. 
However, the high melting points of Mg (T,, = 650 °C) and Sb (T,, = 
631 °C) require the cell to operate near 700 °C. A high operating tem- 
perature is undesirable because it results in higher rates of corrosion and 
detracts from overall storage efficiency, which ultimately increases cost 
of ownership. These potential limitations, in conjunction with an esti- 
mated electrode materials cost of US$375 kWh‘ and an average cell volt- 
age of 0.21 V (measured under galvanostatic discharge at 200 mA cm”), 
render Mg||Sb cells impractical for commercial applications’. 

With an average cell voltage of 0.92 V, the Li||Sb combination is an 
appealing alternative’. Moreover, Li melts at 180 °C and exhibits low 
solubility in lithium halide melts, which results in lower self-discharge 
current and, hence, higher energy efficiency especially when compared 
with sodium alternatives®. Despite these attractive properties, the high 
melting point of Sb sets the operating temperature of the Li||Sb cell at 


almost 500 °C above the melting point of Li. Although alloying Sb with 
another metal can be an effective strategy to lower the melting point of 
the positive electrode, this is generally accompanied by an undesirable 
decrease in cell voltage, as observed in the Mg||Sn-Sb (ref. 9) and Na||Bi- 
Sb systems (ref. 10). 

Here we report that, with the use of a Li negative electrode, the ad- 
dition of Pb to Sb maintains a cell voltage almost as great as that for pure 
Sb while substantially reducing the melting temperature (eutectic com- 
position, Sb-Pb 18:82 mol%; melting temperature, T,, = 253 °C; ref. 11). 
Figure 1 shows the equilibrium voltage of the Li||Sb-Pb cell as a func- 
tion of Li concentration in various Sb-Pb alloys. Measurements were 
made by coulometric titration in a LiF-LiCl-Lil molten salt electrolyte 
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Figure 1 | Electromotive force of Li-Sb-Pb electrodes measured by 
coulometric titration at 450 °C. Electromotive force (EMF) as a function of 
Li concentration in Sb-Pb alloys (a), and as a function of Li concentration 
normalized with respect to Sb (b). Pure Sb data are from ref. 7. 
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(20:50:30 mol%, Ti, = 430 °C). Four compositions of Sb-Pb alloys are 
shown here: 50:50, 40:60, 30:70 and 18:82 mol% (eutectic composition). 
The corresponding equilibrium voltages of the binary Li||Sb cell’* and 
the binary Li||Pb cell’* are also shown for comparison. At the chosen 
experimental temperature (450 °C), Sb-Pb alloys and pure Pb are liquid, 
whereas Sb is solid. As shown in Fig. 1, the Li||Sb (solid) chemistry has 
the highest cell voltage, at 0.92 V, and the Li||Pb chemistry exhibits the 
lowest cell voltage, just under 0.6 V. We note that even at high dilution 
(up to 82 mol% Pb in Sb), the Li||Sb-Pb systems operate at cell voltages 
very near Li||Sb levels (only about 0.05 V lower), indicating that the 
Li||Sb-Pb electrode potential is determined primarily by the Li-Sb in- 
teraction. To reveal the predominant role of Sb in setting the potential, 
Fig. 1b was normalized to the concentration of Li relative to Sb. Figure 1b 
shows that all Li-Sb-Pb electrodes share a behaviour similar to that of 
the Li-Sb electrode: in each case, a region of near-constant, high poten- 
tial is followed by a drop in potential at 75 mol% Li relative to Sb. This 
behaviour is suggestive of the Li||Sb system in which Li,Sb (x < 3) com- 
pounds are formed and thereby generate a high electrode potential rela- 
tive to pure Li. When the concentration of the alloy reaches 75 mol% Li 
in Sb, a low-potential Li3Sb phase is formed, and the cell voltage drops 
precipitously’. 
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Figure 2 | Performance ofa Li| |Sb-Pb cell cycled at 275 mA cm ~. a, Profiles 
of voltage and current density during charge-discharge (15th cycle). The results 
derive from measurements on more than 10 cells. b, Coulombic efficiency, 
energy efficiency and discharge capacity density as functions of cycle number. 
The theoretical cell capacity was 1.9 Ah with a fully discharged target 
composition of 45% Li in an Sb-Pb 30:70 mol% alloy (3.16 cm” of active surface 
area). The operating temperature was 450 °C. The results derive from 
measurements on more than two cells. 
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Cell performance was demonstrated in 1.9 Ah theoretical capacity cells 
(3.16 cm? positive-electrode/electrolyte interfacial area) fitted with a 
Li negative electrode, an Sb-Pb positive electrode (30-70 mol%) and a 
LiF-LiCI-Lil molten salt electrolyte (20:50:30 mol%, T,, = 430 °C). Cells 
were assembled in the fully charged state in an Ar-filled glove box, placed 
inside a sealed test vessel and operated in a vertical tube furnace at 450 °C. 
When the temperature exceeded the melting point of the salt, the equi- 
librium cell voltage stabilized at ~ 1.0 V, consistent with titration results. 
Ata stepped-potential of 1.2 V, the self-discharge current was measured 
to be 0.6 mA cm ”, which is significantly lower than the value observed 
in Na systems® (20 mA cm”). This is attributable to the lower solubility 
of Li in its molten halides. A typical charge—discharge voltage profile 
and the usual performance metrics as functions of cycle index are shown 
in Fig. 2. Ata high current density of 275 mA cm”, cells consistently 
achieved 93% of their theoretical capacity. The nominal discharge volt- 
age was 0.73 V, which is more than three times higher than that of Mg||Sb. 
On the basis of measured cell performance, the electrode materials costs 
are estimated to be US$68 kWh__*, which is about one-fifth of the value 
for Mg||Sb cells. 

For stationary applications, long service lifetime is a critical factor. 
Liquid metal batteries are advantageous owing to the liquid electrodes 
and molten salt electrolyte, which avoid many of the common failure 
mechanisms associated with batteries fitted with solid-state electrodes, 
for example undesirable film formation at the electrode-electrolyte in- 
terface, or phase transformations that mechanically damage cell com- 
ponents’*. Here we show the results of charge-discharge cycle testing 
of the Li||Sb-Pb system (Fig. 2b). Over the duration of the test, the cells 
exhibited a Coulombic efficiency of 98% and a round-trip energy effi- 
ciency of 73%, maintaining 94% of the initial capacity after 450 cycles 
at full depth of discharge. The capacity fade rate decreased after the 100th 
cycle, whereupon the fade rate between the 100th and 450th cycles was 
0.004% per cycle. This is equivalent to retention of over 85% of the initial 
capacity after ten years of daily cycling. Cross-sections of cells after 
1,800 hours of operation did not exhibit any obvious corrosion of cell 
components (current collectors and walls). 

The ability to operate at high current densities with minimal impact 
on cycle life is an asset for certain grid applications such as ancillary ser- 
vices. Here we show the capability of Li||Sb-Pb cells with a LiF-LiCl- 
Lil salt electrolyte (20:50:30 mol%, T,, = 430 °C) operating at current 
densities as high as 1,000 mA cm” (Fig. 3) not only while discharging, 
but also while charging. The latter is especially useful in applications such 
as frequency regulation. At current densities as high as 500 mA cm”, 
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Figure 3 | Voltage profiles during charge-discharge at different current 
densities (100-1,000 mA cm”) of a Li||Sb-Pb cell. The theoretical capacity 
was 1.33 Ah with a fully discharged target composition of 45% Li in an Sb-Pb 
30:70 mol% alloy (2.0 cm? of active surface area). The operating temperature 
was 450 °C. The results derive from measurements on more than three cells. 
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Figure 4 | Performance ofa Li||Sb-Pb cell cycled at 275 mA cm ~. a, Profiles 
of voltage and current density during charge-discharge (15th cycle). 

b, Coulombic efficiency, energy efficiency and discharge capacity density as 
functions of cycle number. The theoretical cell capacity was 62 Ah with a fully 
discharged target composition of 52.4% Li in an Sb—Pb 40:60 mol% alloy 

(62 cm’ of active surface area). The operating temperature was 500 °C. 


there is no significant decrease in the reversible capacity (87% of the 
theoretical value). Even at the highest current density (1,000 mA cm”), 
the cell performed at 54% of its theoretical capacity. Most noteworthy 
about this last observation is the ability of the cell to act as a high-current 
load without incurring permanent damage: efficiency in this instance 
is subordinate to long-term electrode stability. This advantageous mix 
of features is attributable to the rare combination of the high conduc- 
tivity of the molten salt electrolyte, ultrafast charge-transfer kinetics at 
the electrode-electrolyte interface between the liquid metal and mol- 
ten salt, and fast mass transport within the liquid metal electrodes. 
To demonstrate the scalability of the system, cells with a 62 Ah theo- 
retical capacity (62 cm”) were constructed and operated with perform- 
ance similar to that achieved on the smaller scale (1.9 Ah). To optimize 
systems costs, a LiF-LiCl-LiBr eutectic electrolyte (22:31:47 mol%, T,,, 
= 443 °C) and a positive-electrode Sb-Pb composition of 40:60 mol% 
were chosen. Operating at a temperature of 500 °C, the cells were cycled 
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at 275 mA cm ~. Figure 4 shows a typical charge-discharge voltage pro- 
file and the cell performance metrics over twenty cycles. With average 
values of Coulombic efficiency of 98% and a round-trip energy efficiency 
of 71%, negligible capacity fade was observed over the duration of the 
test. The nominal discharge voltage was measured to be ~0.69 V. On 
this basis, the electrode materials costs were estimated to be US$65 kWh’ '. 

Alloying Sb with Pb has been identified as a way to achieve signifi- 
cant reductions in the melting point of the positive electrode as well as 
the cell operating temperature without an attendant decrease in cell volt- 
age. This finding not only lowers the cost of Li||Sb-Pb batteries, increas- 
ing their attractiveness for stationary applications, but also serves as an 
example of how to broaden the selection of positive-electrode materi- 
als for liquid metal batteries. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


For all experiments, high purity (>99.9%), ultradry-grade LiF, LiCl, LiBr and Lil salts 
(Alfa Aesar) were used in electrolytes. Salt mixtures were dried under vacuum at 80 °C 
for 8 hand 250 °C for 2 h to remove residual water, and were then melted under Ar 
gas at 600 °C. All experiments were performed under a high-purity Ar atmosphere. 

For the evaluation of equilibrium voltage by coulometric titration, all alloy target 
compositions were prepared using an arc-melter (MAM1, Edmund Biihler GmbH). 
After arc-melting, alloys were placed into a small mullite basket, pre-melted and 
used as working electrodes. Electrical contact was established using a tungsten wire 
(1 mm diameter) immersed in the Sb-Pb alloy. Ag/AgCl served as the reference elec- 
trode (3 wt% AgCl in a LiCl-NaCl-KC] eutectic mixture) and was contained within 
a closed-end mullite tube with the end polished to a thin microporous layer. A 
40:60 mol% Li-Al alloy was used as the counter-electrode. Electrochemical measure- 
ments were performed with an Autolab PGSTAT 302N potentiostat/galvanostat. 

For cell testing, galvanostatic charge and discharge were performed using an Arbin 
BT2000. Alloys were pre-weighed and placed in cell containers. The salt mixtures 
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were introduced and dried in situ under vacuum at 80 °C for 8 h and 250 °C for 4h 
before setting the operating temperature for electrochemical testing. A cell schem- 
atic of the Li||Sb-Pb liquid metal battery is shown in Extended Data Fig. 1. 

All electrode cost estimations were performed using the following formula: C = 
&;P;m;/E, where C is the capital cost per unit of discharged electrical energy in 
US$ kWh |, P;is the specific bulk metal cost in US$ kg™ | mis the metal’s mass in kg, 
and Eis the discharged energy in kWh. For Li||Sb-Pb cells, the value of discharged 
energy, E, was that measured during galvanostatic cycling at 275 mA cm” (Figs 2 
and 4). For Mg||Sb cells, the discharged energy, E, was that measured during galva- 
nostatic cycling at 200 mA cm ~ (ref. 6). Average bulk metal prices of Li (US$61.7 kg”), 
Mg (UD$2.7 kg), Pb (US$2.1 kg") and Sb (US$10.1 kg") were obtained from the 
MetalPrices Online Database (http://www.metalprices.com literature). Detailed cost 
calculations for the cells are presented in Extended Data Tables 1-3. A balance 
of system and salt costs is not included because the technology has yet to be fully 
developed on the commercial scale, and so there is no accurate basis for such 
estimation. 
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Extended Data Figure 1 | Cell schematic of Lil |Sb-Pb liquid metal battery. area) or 304 stainless steel (large cell; 62 cm? active area). Current collectors are 


The negative current collector consists of a stainless steel rod and Fe-Ni foam. _ electrically isolated by means of an alumina insulator. 
The positive current collector is made of graphite (small cell; 3.16 cm?” active 
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Extended Data Table 1 | Cost calculation of Mg||Sb 2.5 Ah cell 
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Extended Data Table 2 | Cost calculation of Li||Sb-Pb 1.9 Ah cell 


Li Pb Sb ‘Total 
mig 0.49 12.57 3.17 16.22 
n/mol 0.07 0.06 0.03 
M/g.mol-1 6.94 207.20 121.76 
P/$/kg 61.70 2.10 10.09 
Pmolar/$/mol 0.43 0.44 1.23 
C/$/kWh 23.40 20.34 24.62 68.37 
Energy density / Wh/kg 79.96 


Capacity/Ah 1.90 
nLi/(nLi+nPb+nSb) 0.45 
nSb/(nSb+nPb) 0.30 
nLi/(nLi+nSb) 0.73 


Average discharge energy / kWh 1.30E-03 
F/C.mol-1 96485.40 
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Extended Data Table 3 | Cost calculation of Li||Sb-Pb 62 Ah cell 


Li Pb Sb __ Total 
m/g 16.00 259.80 101.70 377.50 
n/mol 2.31 1.25 0.84 
M/g.mol-1 6.94 207.20 121.76 
P/$/kg 61.70 2.10 10.09 
Pmolar/$/mol 0.43 0.44 1.23 
C/$/kWh = 25.24 13.95 26.23 65.41 
Energy density / Wh/kg 103.63 


Capacity/Ah 61.78 
nLi/(nLi+nPb+nSb) 0.52 
nSb/(nSb+nPb) 0.40 
nLi/(nLi+nSb) 0.73 


Average discharge energy / kWh 3.91E-02 
F/C.mol-1 96485.40 


©2014 Macmillan Publishers Limited. All rights reserved 


ade Tea 


doi:10.1038/nature13767 


High winter ozone pollution from carbonyl 
photolysis in an oil and gas basin 
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The United States is now experiencing the most rapid expansion in 
oil and gas production in four decades, owing in large part to imple- 
mentation of new extraction technologies such as horizontal dril- 
ling combined with hydraulic fracturing. The environmental impacts 
of this development, from its effect on water quality’ to the influence 
of increased methane leakage on climate’, have been a matter of intense 
debate. Air quality impacts are associated with emissions of nitro- 
gen oxides** (NO, = NO + NO,) and volatile organic compounds” 
(VOCs), whose photochemistry leads to production of ozone, a sec- 
ondary pollutant with negative health effects®. Recent observations 
in oil- and gas-producing basins in the western United States have 
identified ozone mixing ratios well in excess of present air quality 
standards, but only during winter”"’*. Understanding winter ozone 
production in these regions is scientifically challenging. It occurs 
during cold periods of snow cover when meteorological inversions 
concentrate air pollutants from oil and gas activities, but when solar 
irradiance and absolute humidity, which are both required to initi- 
ate conventional photochemistry essential for ozone production, are 
at a minimum. Here, using data from a remote location in the oil 
and gas basin of northeastern Utah and a box model, we provide a 
quantitative assessment of the photochemistry that leads to these 
extreme winter ozone pollution events, and identify key factors that 
control ozone production in this unique environment. We find that 
ozone production occurs at lower NO, and much larger VOC con- 
centrations than does its summertime urban counterpart, leading 
to carbonyl (oxygenated VOCs with a C=O moiety) photolysis as a 
dominant oxidant source. Extreme VOC concentrations optimize 
the ozone production efficiency of NO,. There is considerable poten- 
tial for global growth in oil and gas extraction from shale. This anal- 
ysis could help inform strategies to monitor and mitigate air quality 
impacts and provide broader insight into the response of winter ozone 
to primary pollutants. 

One of the key scientific challenges in understanding winter ozone 
(O3) is determining the source of the radicals (gas-phase molecules with 
an unpaired electron that react rapidly with VOCs) required to initiate 
and sustain oxidation cycles. Quantifying these sources is essential for 
understanding the individual roles of NO, and VOCs during these O; 
pollution episodes and for the design of mitigation strategies”. By far 
the largest radical source in the lower atmosphere is the photolysis of 
O; itself, which produces a small yield of electronically excited oxygen 
atoms, O('D), some of which react with water vapour to produce hydrox- 
yl (OH) radicals'*. During mid-latitude winter, both ultraviolet light and, 
especially, water vapour are far less abundant than in summer, leading 
to a 15- to 60-fold decrease in primary OH production through this 


mechanism'®’”. The seasonal cycle in mid-latitude OH production is 
responsible for the summertime maxima in urban O; but presents a 
conundrum for understanding winter O; events (Fig. 1). 

The Uintah Basin Winter Ozone Studies (UBWOS) were a set of field 
intensives (large sets of air and radiation measurements occurring for 
a limited duration, typically weeks to months) at a remote location 
(40.1437° N, 109.4680° W) within the oil and gas basin of northeastern 
Utah (Fig. 1) during January and February of 2012, 2013 and 2014, moti- 
vated by observations of high O3 in two preceding years. Winter O3 is 
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Figure 1 | Seasonal cycle of O3 in the Uintah Basin, Utah and the Los 
Angeles Basin, California in 2013. a, Digital elevation map (elevation 
indicated by colour scale) of the Uintah Basin showing oil and gas wells 
(grey dots), O; monitors (red circles) urban centres (yellow squares) and the 
site of the field intensives (Horsepool, blue diamond). b, Graphs at left show 
daily maximum 8-h average O} for 2013 at Ouray, Utah, a remote site in the 
Uintah Basin (population 50,000), and Riverside, California, an urban receptor 
site in the eastern Los Angeles Basin, a region with 18 million residents. Graphs 
at right show data sorted by increasing O3 mixing ratio, together with the 
number of days in excess of the US national ambient air quality standard 

(75 p.p.b.v., 8 h average; black dashed line). In 2013, O3 exceedances were more 
frequent and greater in severity at Ouray than at Riverside, despite the large 
difference in population. 
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clearly related to oil and gas emissions; although inventories remain 
uncertain, the US Environmental Protection Agency estimates that oil 
and gas activities are responsible for 62% of NO, emissions and 97% of 
VOC emissions in the two counties that comprise the Utah side of the 
basin. The winter of 2011-2012 was warm, with no snow cover and only 
moderate (16 p.p.b.d~' average) O3 production and no O; in excess of 
51 p.p.b. by volume’® (p.p.b.v.). Multiple strong O3 events occurred 
during the colder and consistently snow covered winter of 2012-13, with 
threefold-greater daily average O, production than during the previous 
year. Meteorologically stagnant conditions that concentrate emissions 
ina shallow boundary layer have been a prerequisite of winter O; events 
observed thus far. These conditions are amenable to treatment with a 
box model, in which the relevant chemical reactions are simulated in a 
zero-dimensional ‘box’; emissions of primary pollutants into the box 
are included, and transport and dry deposition processes are repre- 
sented through a first-order loss term. Further details of the model, 
containing an updated Master Chemical Mechanism v3.2 chemistry 
scheme’’ containing more than 10,000 reactions, is in Methods. The 
near-explicit model of radical sources, propagation and amplification 
allows a powerful analysis of the factors that govern winter O3 produc- 
tion and that differentiate it from its summer, urban counterpart. 

Figure 2 shows a single, stagnant, 6d period (31st January to 5th 
February 2013) during which daily-mean O3 mixing ratios increased 
from 54 to 95 p.p.b.v and the daily maximum 8 h-average O; increased 
from 67 to 107 p.p.b.v. This event was the longest sustained build-up 
of O3, although even higher mixing ratios were observed during the 
2013 study (Fig. 1). Throughout this period, the model reproduces the 
observed build-up and diurnal cycle of O3, with a mean 10 min-average 
model-to-measurement discrepancy of +4%. The model also accurately 
reproduces observed concentrations of the key oxidized reactive nitro- 
gen (for example peroxyacetyl nitrate, with an average model deviation 
of + 1%) and oxygenated VOCs (for example acetaldehyde, —2%) over 
the 6d simulation, providing additional confidence in the simulation 
of VOC-NO,, photochemistry and O; production. Further details on 
model performance are in Methods. 

The detailed chemical mechanism enables the identification of the 
radical sources that drive O3 production. The pie charts on the right 


side of Fig. 2 and Extended Data Table 3 show the integrated radical 
sources on the final day of the simulation. Primary OH production 
through O('D) + HO was small (0.74 p.p.b.v. d__', or 4% of the total), 
as is expected for this winter environment. By comparison, this source 
is approximately 10 p.p.b.v.d_ * in the Los Angeles basin in summer”, 
but was only 0.17 p.p.b.v.d~ ' during UBWOS 2012, when O; levels were 
much lower’®. The reaction of O3 with unsaturated hydrocarbons (al- 
kenes) is an OH source that can be large during periods of high urban 
O; (ref. 20), but contributes only 0.34 p.p.b.v. d_' (1.8% of the total) here 
owing to the low emissions of alkenes relative to alkanes and aromatics”. 
Photolysis of nitryl chloride, CINO;, which arises from the night-time 
heterogeneous reactions of nitrogen oxides” was also small, probably 
as a result of a lack of aerosol-phase chloride. Nitrous acid (HNO,) also 
forms from heterogeneous reactions of nitrogen oxides, and photolyses 
readily to produce OH radicals. The sources and atmospheric chemistry 
of HNO, have been the subject of intense recent interest (see, for example, 
ref. 22), including during the winter of 2011 in Wyoming”. The photo- 
lysis of HNO, was the least certain free-radical source because of the 
difficulty in measuring it reliably. The HNO) contribution to UBWOS 
2013 in Fig. 2 is an estimate based on measurements in 2012 (Methods). 
The shaded region in the plot of O; in Fig. 2 illustrates the effect of HNO» 
on the calculated O3, with the lower bound being a simulation with zero 
HNO, and the upper bound a simulation with a twofold increase in 
HNO) . This radical source is not required to simulate O3 build-up events 
accurately, although it may have a significant role during the initiation 
stages, when other radical sources are smaller. 

By far the dominant radical source (85%) in the simulation on day six 
is the photolysis of carbonyl compounds. The model under-predicted 
formaldehyde, the simplest carbonyl, by 30%, and an additional constant 
formaldehyde source was added to achieve agreement with observations. 
This source could arise from a direct emission or incomplete model chem- 
istry. Removal of this source results in 6%-lower peak O3 on day six. 
Even with additional formaldehyde added to the model, the majority 
of the radical production on day six (9.3 p.p.b.v.d_' and 50.5%) is due to 
the photolysis of larger carbonyl compounds (keto-aldehydes, glyoxal+ 
methyl glyoxal and mono-aldehydes, comprising 13.5, 11.2 and 9.3%, res- 
pectively). Although carbonyl compounds are products of VOC oxidation, 
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Figure 2 | Observed and modelled photochemistry at Horsepool, Utah. 

a, Observed ozone (red) during a multi-day meteorological stagnation event. 
Overlaid on the observations is a chemical box model simulation (blue) 
employing the Master Chemical Mechanism scheme. The solid line is the base 
simulation (see main text and Methods), and the shaded region is the range 
of simulation without photolysis of nitrous acid (HNO,) or with twice the 
base-case HNO, photolysis as a radical source. b, Comparison between model 
(blue) and measurement (red) for peroxyacetyl nitrate (PAN; left) and 
acetaldehyde (CH;CHO; right), which are the photochemical oxidation 
products of reactive nitrogen and VOCs, respectively. c, Contributions of 
different free radical sources on day six of the simulation for total sources (top) 
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and primary sources only (bottom), sized according to their relative 
magnitudes. The lower pie chart illustrates only the primary radical production 
(O3 photolysis, O3 alkene reactions and photolysis of radical precursors 
produced in heterogeneous nitrogen oxide reactions). If the additional 30% 
H,CO added to the model arises from an emission source, it would represent a 
primary radical source and would be included in the lower pie chart. These 
sources are presumably responsible for initiation of the early stages of O3 
production. O3 photolysis and reaction with alkenes, which are shown for day 
six of the simulation, scale with O; itself and are therefore smaller contributors 
at the onset of O3 build-up events. 
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they are net radical sources that function as radical amplifiers because 
they are chemically stable products formed during radical propagation 
chains (that is, reactions that consume and produce one radical). Further 
details on this process are in Methods. 

As noted above, no high-O; events occurred during the 2012 study, 
raising the question of the role of snow cover in driving O3 production. 
The principal effects of snow cover are to increase surface albedo and, 
thus, actinic flux for photolysis reactions, and to reduce the mixed- 
layer height, concentrating primary emissions. Simulations with reduced 
albedo (from the observed 0.85 to 0.1) but the same emissions and 
physical loss of NO, and VOC result in a 33% decrease in peak O3 on 
day six. A simulation with VOC mixing ratios equivalent to those of 
the 2012 study year (that is, assuming the same high VOC emissions 
between the two years, but with greater dilution in 2012), but with an 
albedo of 0.85, results in a 45% reduction in peak O; on day six'®. Thus, 
both high VOC emissions into a shallow, stable boundary layer, and 
increased photolysis rates due to the snow albedo, are required for 
rapid winter O; production. 

A key question in the design of O3 mitigation strategies is the relative 
effectiveness of emissions reductions in VOC precursors versus NO, 
precursors“. Analysis of the lower-O3 year during UBWOS 2012 showed 
it to be radical limited’®, because the rate of radical production was small 
compared with the rate of emission of NO,, (ref. 23), which determines 
the rate of radical removal. Radical limitation normally leads to NO, 
saturation (that is, increased NO, leads to decreased O3), which is typical 
of most urban areas in winter, where O3 is generally well below air qua- 
lity standards. Figure 3 shows the contours of an O3 isopleth diagram 
for the build-up event in Fig. 2, that is, peak O on day six of the sim- 
ulation as a function of NO, and VOC emissions, normalized to unity 
for the base-case simulation of Fig. 2. The total net radical production 
of 18.5 p.p.b.v.d' is sufficient to prevent NO, saturation. As the right- 
most graphs in Fig. 3 show, NO, is near its peak efficiency for O3 pro- 
duction, and the response of O3 to VOC emissions is just beyond the 
transition from VOC sensitive to VOC saturated. These results contrast 
with an earlier model study of the Upper Green River Basin, Wyoming, 
in which simulations of three of four events were NO, saturated and 
VOC sensitive according to analysis of single-day events using a lumped 
VOC degradation scheme’. 

The isopleth diagram in Fig. 3 appears similar to that for summer 
urban O; (ref. 14). However, the graphs on the left-hand side of the 
figure, which compare NO,, VOC mixing ratios and OH reactivity 
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(see below) in the Uintah Basin in winter to the Los Angeles Basin in 
early summer (May-June), illustrate significant differences. The Los 
Angeles measurements are from the CalNex field intensive (May-June 
2010, Pasadena, California), during which the maximum 8 h-average 
O3 reached 84 p.p.b.v. The distributions of NO, and VOCs in these 
environments are quite different, with much (nearly fourfold) lower 
NO,, but much (~ 15 times) greater VOC in the Uintah Basin. The com- 
position of VOCs from urban emissions differs considerably from that 
of VOCs from oil- and gas-producing regions’, with the latter domi- 
nated by compounds (that is, light alkanes) that are less reactive to 
OH radicals and, thus, less effective at producing O3 (ref. 24). Even 
accounting for this difference, the median calculated OH reactivity 
(that is, the sum of all VOC + OH rate coefficients multiplied by the 
VOC concentration) is 4.6 times greater in the Uintah Basin than in 
Los Angeles. (We note that methane, although abundant in the Uintah 
Basin”’, accounts for less than 1.5% of the calculated OH reactivity.) 
Thus, although similar in appearance to that of an urban area, the O; 
isopleth in Fig. 3 occupies a very different NO,-VOC space. Were NO, 
in the wintertime Uintah Basin more comparable to that of urban 
settings historically, the O; photochemistry would be fully NO, satu- 
rated and O; production would thus be significantly less efficient. 
Conversely, if the VOC mixing ratios and OH reactivity in the winter- 
time Uintah Basin were more typical of an urban area, the photochem- 
istry would not give rise to strong O; events. It is the exceedingly high 
VOC concentrations, and the radicals produced during their oxidation, 
in the oil and gas region that leads to highly efficient O3 production. 

Although observed extreme winter O, events in the United States have 
been limited to meteorologically stagnant conditions in mountain basins, 
similar phenomena may occur in regions with oil and gas development 
without routine air quality monitoring'’. Present emissions trends in the 
United States are towards lower NO, from urban and power generation 
sources”®”’, and increasing methane and VOCs from fossil fuel develop- 
ment’. Urban areas in close proximity to oil- and gas-producing regions 
may tend towards more efficient O3 production during the winter sea- 
son, with as yet unrecognized consequences. Shale gas development in 
other mid- or high-latitude regions that may experience stable winter 
meteorology, such as continental Europe”’, the United Kingdom” and 
China” lags that of the United States but holds the same potential for 
rapid exploitation*’. The measurement and model framework outlined 
here will serve to better define emerging air quality issues associated 
with global development of new fossil fuel resources. 
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Figure 3 | Isopleth diagram for winter O3 production. a, Isopleth with 
contours showing peak O; on day six of the simulation in Fig. 2 as a function of 
the emissions of NO, and VOC in the model relative to the base case (red 
symbol). The dashed line indicates the 75 p.p.b.v. US national ambient air 
quality standard (NAAQS). b, Response of peak O3 to NO, (top) and VOC 
(bottom), expressed in mixing ratio units. These plots are slices through the 


surface at the red horizontal and vertical lines in the isopleth diagram, with the 
base case shown as the red symbol. c, Comparison of distributions of NO, (top), 
VOC (middle, expressed as parts per million carbon) and calculated OH 
reactivity with respect to VOC (bottom; see text) for the Uintah Basin, Utah in 
January-February 2013 (blue bars) and for measurements in May-June 2010 in 
Pasadena, California in the Los Angeles Basin (red bars). 
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METHODS 


Model chemistry scheme and constraints. The meteorologically stagnant con- 
ditions associated with winter O3 events are amenable to treatment with a box model, 
in which the relevant chemical reactions are simulated in a zero-dimensional ‘box’, 
which includes emissions of primary pollutants into the box and the representa- 
tion of both the transport of species out of the box and dry deposition processes 
through a first-order physical loss term. This approach has the advantage of allow- 
ing detailed treatment of the VOC oxidation chemistry, at the expense of a com- 
prehensive representation of dynamical processes. The approach has been applied 
previously’ to simulate winter O events in the Upper Green River Basin (UGRB) 
of Wyoming. Our model builds upon this prior analysis in a number of respects. 
First, our simulation uses an explicit, rather than lumped, VOC degradation scheme. 
The explicit chemistry accurately simulates radical sources derived from carbonyl 
photolysis, key to understanding the VOC and NO, sensitivities of O; produc- 
tion. Second, we simulate a multi-day build-up event rather than single-day ozone 
events. Third, our simulations use continuous emissions of VOC and NO,, tuned 
to match observed levels, while the UGRB simulations used an initial VOC and NO, 
concentration for each day, which was subsequently allowed to oxidize away. Fourth, 
our simulations use an explicit scheme for the diurnal variation in boundary-layer 
height (expressed as a dilution term in the box model) matched to LIDAR obser- 
vations and to the diurnal variation in long-lived species, such as methane. The 
UGRB simulations used either a fixed boundary layer, or a linear growth in bound- 
ary layer. Fifth, our simulations are constrained to realistic diurnal variations of 
nitrous acid, HNO,, based on observations, rather than expressing HNO, as a fixed 
ratio to NO>. Sixth, our simulations appear to use a more realistic range of NO, 
(<10 p.p.b.v. range for the diurnal average, rather than an initial concentration exceed- 
ing 100 p.p.b.v.). Seventh, our simulation is tested against observations of photo- 
chemical products other than ozone, such as peroxyacetyl nitrates and oxygenated 
VOCs, to lend confidence to the result. Lastly, our simulations explicitly calculate 
radical sources and their magnitude, and use this information to define the VOC 
and NO, sensitivity. 

Model simulations were performed using the Dynamically Simple Model of 
Atmospheric Chemical Complexity? ** (DSMACC). The model chemistry scheme 
is generated by the Master Chemical Mechanism (MCM) v3.2’*** and contains 
detailed inorganic chemistry and a near-explicit degradation scheme for 32 of the 
observed VOCs and OVOCs (Extended Data Table 1), resulting in 2,754 species 
and 10,675 reactions. The MCM v3.2 chemistry scheme has been updated to include 
temperature-dependent yields for organic nitrates based on ref. 36. This change 
to the mechanism was important because temperatures during winter ozone events 
are significantly lower than during more typical, summertime urban ozone forma- 
tion (for example, —8.5 °C was the average during the 2013 measurements in the 
Uintah Basin, Utah). 

The UBWOS field intensives made an extensive set of meteorological and chem- 
ical observations with which to constrain the model, including more than 60 spe- 
ciated VOCs, speciated reactive nitrogen (NO, and its major oxidation products) 
and photochemical radical precursors (Extended Data Table 1). Modelling of 
UBWOS 2012, during which there was no snow on the ground and ozone did not 
exceed the US national ambient air quality standards, has already been reported’®. 
During UBWOS 2012, the deployed instrumentation provided a more extensive 
suite of observed VOCs than was available during 2013 (see Methods section on 
chemical and radiation measurements). Thus, to maximize the model constraints 
for UBWOS 2013, several of the UBWOS 2012 observations have been used to 
inform the 2013 model study. Aromatic VOCs in 2013 were measured by PTR-MS, 
providing a sum of all aromatic species at individual carbon numbers (that is, XC, 
aromatics, XCj9 aromatics and so on). The use of a GC-MS co-located with the 
PTR-MS in 2012 allowed the speciation of these aromatic classes. Although no 
GC-MS was present during UBWOS 2013, the PTR-MS aromatic classes have 
been allocated to specific compounds on the basis of the 2012 speciation ratios. 
This approach rests on the assumption that relative emissions from the oil and 
gas production source did not change between the two years. A similar approach 
was also used for the substituted cyclo-alkane species that were measured during 
UBWOS 2012 but not 2013. During UBWOS 2012, all the cyclo-alkanes had a 
similar daily profile, indicating a common source. Although many of these com- 
pounds were not measured during UBWOS 2013, their concentrations have been 
estimated on the basis of their observed ratio to cyclohexane in 2012. As the MCM 
does not contain explicit oxidation schemes for the seven substituted cyclo-alkanes 
whose concentrations have been estimated, these compounds have been lumped 
as cyclohexane for the simulations described here. The MCM chemistry scheme has 
also been modified to include the photolysis of CINO, to yield a chlorine radical’®. 

All primary VOC species in the model are introduced via a constant emission 
over the entire six-day period, tuned to best match the observed concentrations. As 
the production mechanisms of HNO, and CINO, involve uncertain heterogeneous 
processes, and are therefore difficult to represent in this purely gas-phase mechanism, 
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the concentrations of these radical precursor species have been constrained to an 
average diurnal concentration profile (Extended Data Fig. 1). CINO2 concentra- 
tions were constrained to the observed 2013 average diurnal, while HNO) con- 
centrations were constrained to the observed 2012 average diurnal to provide an 
upper limit for its importance as a primary radical source (see Methods section 
on wintertime photochemistry). 

NO,, within the model was constrained via a constant emission of NO. The model 
chemistry scheme calculates the partitioning between all reactive nitrogen com- 
pounds, and the emission of NO was adjusted to minimize the deviation between 
the model and observed NO, concentration (Extended Data Fig. 1). 

Data from the 2013 Uintah Basin Winter Ozone Studies are available at http:// 
esrl.noaa.gov/csd/groups/csd7/measurements/2013ubwos/. National O3 data, such 
as those in Fig. 1, are available from the US Environmental Protection Agency on 
request (http://www.airnowtech.org). 

Photolysis rates in the model. Direct observations of j(O'D) and j(NO2) were 
available for UBWOS 2012, but not for UBWOS 2013. Instead, a total downwelling 
radiation measurement was used to calculate these photolysis frequencies by com- 
parison with data from 2012, when the total downwelling radiation measurement 
was run alongside calibrated filter radiometers for j(O'D) and j(NO2). Polynomial 
fits to the data on total radiation versus filter radiometer were used to extract a 
calibration for the total radiation measurement for both j(O'D) and j(NO2). Ex- 
tended Data Fig. 2a, b shows the 2012 average diurnal j(NO2) and j(O'D) filter 
radiometer observations and the calculated photolysis frequencies using the 2012 
total radiation measurement. Photolysis frequencies for UBWOS 2013 were then 
calculated using the 2012 calibration for the 2013 total radiation observations (Ex- 
tended Data Fig. 2a, b). The model uses the TUV radiation model” to calculate 
photolysis frequencies. TUV photolysis frequencies were calculated using the aver- 
age observed surface albedo of 0.85 and an average observed O column density of 
323 Dobson units (average OMI data during observation period). The TUV cal- 
culated photolysis frequencies were then scaled to the average j(O'D) and j(NO,) 
determined as described above, with the ratio between the measured and calcu- 
lated j(NO}) being applied to all calculated photolysis rates other than j(O'D). 
Physical loss within the model. Observed concentrations of long-lived species, 
such as methane, did not continue to rise during the entirety of the six-day stag- 
nation event, but instead accumulated during the first days of the event (approxi- 
mately four days for methane; Extended Data Fig. 3a) and then levelled off. As the 
lifetime of methane with respect to chemical loss is of the order of years, this estab- 
lishment of a steady-state concentration indicates losses due to physical mixing 
out of the shallow inversion layer. In addition to the multi-day trend in methane, 
the observed concentrations also show a diurnal pattern, with increasing concen- 
trations until late morning and then a decrease until late afternoon. This diurnal 
pattern indicates increased mixing and dilution of species in the afternoon. This 
is consistent with LIDAR and tether sonde observations’? of boundary-layer height, 
which showed a growth in mixing height from 5-25 m during the night to 130- 
160 m between 12:00 and 16:00 local time, owing to turbulent convective mixing, 
before reducing again as the sun set. 

A first-order loss parameter was used to represent all non-chemical loss of species 
through mixing, and to a lesser extent deposition, in order to prevent the accu- 
mulation of unconstrained species within the model. To represent turbulent con- 
vective mixing in this zero-dimensional model framework, a bi-modal, first-order 
physical loss parameter was used, with one physical loss rate used for the night 
and morning and a second, greater, loss rate used during the afternoon (Extended 
Data Fig. 3b). This parameter was adjusted to best reproduce the methane obser- 
vations, and then applied to all other species within the model, after correcting for 
methane mixing into a non-zero (1.8 p.p.m.v.) background. The same dilution term 
was applied to ozone after accounting for mixing into a background of 50 p.p.b.v. 
(estimated using ozone LIDAR observations). 

LIDAR and tether sonde observations do show some day-to-day variability in 
the timing and magnitude of the change in boundary-layer depth, and, thus, in 
dilution of the surface layer. The simple approach to representing mixing is invari- 
ant from one day to the next, and thus will not capture day-to-day changes in ozone 
that are due to physical losses. The simulation is intended to accurately simulate 
the average daily ozone response and, thus, the sensitivity to average emissions of 
VOCs and NO,,. Simulations where the physical loss parameter was increased or 
decreased by a factor of two showed changes in maximum day-six O3 mixing ratios 
of —43% and +36%, respectively. These changes are due to a combination of changes 
in O3 precursor concentrations within the model as well as the change in the rate 
of physical removal of O; itself. In an attempt to isolate these effects, simulations 
where the physical loss rate of O3 was kept constant and that for all other species 
was increased or decreased by a factor of two gave changes of —20% and +8% 
respectively. As the relative emissions of NO, and VOCs do not change across these 
simulations, it is interesting to note that the calculated daily O; increase on day six 
does not see as large a change as the absolute mixing ratio does between the base 
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model simulation (A[O3] = 39 p.p.b.v.) and the simulations with the physical loss 
rate for all species (including for O;) doubled (A[O3] = 37 p.p.b.v.) and halved 
(A[O3] = 36 p.p.b.v.). 

Ozone surface uptake rates were measured during UBWOS 2013 by eddy cov- 
ariance. The determined 24h median observed ozone deposition rate of 0.02 cm s~* 
was included in all model simulations. This value is within the range from literature 
studies of 0-0.2cms ! over snow. Although included for completeness, this 
loss for ozone is insignificant compared with dilution and chemical losses. 

LIDAR and balloon soundings from Horsepool show that the high O3 is con- 
centrated in the lowest few hundred metres above the surface and is not trans- 
ported to the site from the free troposphere or lower stratosphere. 

The model was initialized using the observed VOC, NO, and O3 concentrations 
at 00:00, 31 January 2013, and calculated oxidation product concentrations after a 
24h spin-up period from this time. The model was integrated forwards, with an 
output time step of 600s, for a period of six days. 

Model performance. In addition to reproducing the observed build-up of ozone 
over the six-day period, the model also reproduced the build-up of other photo- 
chemical oxidation products. Extended Data Fig. 4 shows the model to measurement 
comparisons for (a) acetaldehyde, (b) acetone, (c) 2-butanone (MEK), (d) acetyl 
peroxynitrate (PAN) and (e) propionyl peroxynitrate (PPN). Considering the 
simple dynamical representation and the calculated photolysis frequencies, the 
agreement between the calculated and observed oxidation products over the six-day 
model is excellent (with average model deviations of —2% acetaldehyde, +44% MEK, 
+1% PAN and —16% PPN). This agreement, calculated as the peak of a Gaussian 
fit to the probability distribution of the 10 min-averaged model-to-measurement 
deviation (Extended Data Fig. 4), gives confidence in the model’s ability to repro- 
duce the observed ozone production photochemistry. 

Chemical and radiation measurements during UBWOS 2013. Data used as 
model inputs have been described in the preceding sections. Extended Data Table 2 
summarizes the chemical and physical measurements at the Horsepool site dur- 
ing January and February 2013 that were used for the model analysis. For a dis- 
cussion of the UBWOS 2012 measurements, we refer the reader to ref. 16. 

Briefly, nitrogen oxides (NO, = NO + NO ), ozone (Os) and total reactive nitro- 

gen (NO,) were measured by cavity ring-down spectroscopy (CRDS) at 405 nm, 
which detects NO, directly and other species simultaneously via conversion to NO, 
(refs 39, 40). Similarly, night-time nitrogen oxides were measured by CRDS at 
662 nm, detecting NO; directly and N,O; via thermal conversion to NOs (ref. 41). 
Nitryl chloride (CINO;) was measured, together with speciated PANs, using chem- 
ical ionization mass spectrometry (CIMS) with iodide (I) as a reagent ion”. 
Potential interference due to peroxyacids was checked and found to contribute 
<2% to measured concentrations”. Inorganic acids, including HCl, HNO, and 
HNOs, were measured using negative-ion proton transfer reaction (PTR) CIMS 
with acetate (CH;CO, ) reagent ion**. Oxygenated and aromatic VOCs were 
measured by PTR-MS“ and PTR-Tof-MS* (Tof, time of flight) using H;0* 
primary ions. Speciated VOCs, including alkanes, alkenes, cycloalkanes, aromatics 
and oxygenates were measured by GC-MS in 2012, and by GC-FID (FID, flame 
ionization detection) in 2013“’. Formaldehyde was measured by differential optical 
absorption spectroscopy** (DOAS) in 2012 and with PTR-MS in 2012 and 2013 
using the method in ref. 49. Methane was measured using a commercial cavity 
ring-down instrument’’. Downwelling radiation was measured using a spectral 
pyranometer” as described above. 
Construction of the ozone isopleth (Fig. 3). For this work to have the maximum 
possible relevance for policy makers, the O; isopleth diagram was constructed in 
emissions space. For this purpose, 107 model simulations were carried out across a 
landscape from VOC and NO, emissions at 10% of those required to reproduce 
the observations in the base model to a doubling of emissions. VOC emissions were 
treated as a whole, with the changes applied to the emissions of all VOC species 
for each simulation. A contour plot was then constructed to map out the impact 
of these changes in NO,.and VOC emissions on the maximum calculated O3 mixing 
ratio on day six of the simulation. Owing to the nonlinear nature of the chemistry, 
and, hence, the impact of changing emissions on absolute concentration, Fig. 3b 
was included to illustrate these sensitivities in VOC and NO, concentration space. 
Description of the Uintah Basin. Figure 1 shows O; at Ouray, Utah during 2013, 
and Fig. 2 shows O3 during a six-day period in January-February at the Horsepool 
intensive field site. Figure 1 shows a digital elevation map of the Uintah Basin indi- 
cating the location of the Ouray and Horsepool sites and the distribution of the 
~11,000 oil and gas wells. Also shown are O; monitors (not for regulatory pur- 
poses) deployed during the 2010-2011 winter. All fourteen monitors within the 
basin itself (excluding two at the far west and southwest edges of the map domain) 
recorded at least one, and as many as 25, days above the 75 p.p.b.v. 8 h-average US 
national ambient air quality standard for O; during this winter. Thus, these events 
are widespread, impacting the largest population centres (Vernal, Utah and Rangely, 
Colorado) in the basin*!. 


Sources of radicals for wintertime photochemistry. Figure 2 shows the relative 
contributions of different radical sources to the model simulation on day six. Ex- 
tended Data Table 3 shows the absolute and relative contributions of these differ- 
ent radical sources. 

Primary radicals versus radical amplification reactions. Understanding the 
source of radicals that drive tropospheric oxidation is central to understanding tro- 
pospheric O; photochemistry”. Throughout the troposphere, the major primary 
radical source is the photolysis of O3 and the subsequent reaction of O('D) with 
water vapour to yield 2 < OH (ref. 15), a process that generates radicals from stable 
precursors without radical loss. In polluted environments, numerous other radical 
sources, often derived from heterogeneous reactions of NO, or from intermediates 
in the VOC degradation process may also contribute'**’. During UBWOS 2013, 
the net primary radical source was small (2.8 p.p.b.v. d~'; Fig. 2); however, the num- 
ber of radicals produced by these primary sources was greatly amplified through 
the production of photolabile oxygenated VOCs (OVOCs), particularly carbonyl 
compounds, which photolyse to produce additional radicals. The simplest example 
of this process is the high-NO, oxidation of methane (reactions SR1-SR5, shown 
in equations (1)-(5)), in which an OH attacks methane in SR1 to produce a peroxy 
radical. This radical is propagated through SR2, in the formation of an alkoxy 
radical, and SR3, in the formation of a hydro-peroxy radical, to reform OH in SRS. 
In addition to a hydro-peroxy radical, SR3 also produces formaldehyde, a chem- 
ically stable product of the radical propagation chain that photolyses readily (for 
example, during UBWOS 2013 the HCHO lifetime with respect to photolysis was 
approximately 4h between the hours of 10:00 and 15:00 local time). Its photolysis 
produces two hydro-peroxy radicals in its dominant photolysis channel, whose 
yield is approximately 70% (ref. 53). Formation of two radicals in addition to the 
radical that is propagated through this mechanism amplifies the primary radical 
source 


OH + CH, + 0,-> CH;0,° + H,O (1) 
CH3;0,° + NO-> CH;0° + NO, (2) 

CH,0° + 0,-> HCHO + HO, (3) 

0.7 X (HCHO + hv + 20,—> HO, + HO, + CO) (4) 
2 X (HO, + NO-> OH + NO) (5) 


Extended Data Fig. 5a is a detailed version of the radical source pie chart shown in 
Fig. 2, providing a greater breakdown of the moiety of the species that constitute 
the carbonyl radical source. The high yield of photolabile oxygenated products (the 
two most significant being formaldehyde and methyl-glyoxal) in the UBWOS 2013 
calculations has the effect of rapidly amplifying the primary radical production, by 
as much asa factor of 3-4 between the hours of 10:00 and 18:00. Formaldehyde is 
the single largest contributor to this source. It arises from the oxidation of all the 
VOCs, but, in particular, from light alkanes, such as C,H¢, i-C5H12, i-C4Hyo and 
3-methylpentane, which were highly abundant in the Uintah Basin. Oxidation of 
aromatics, in particular toluene and the xylenes, also produces numerous photo- 
labile carbonyl compounds, many with both a ketone and an aldehyde functional 
group, including methyl-glyoxal, which is the second largest single contributor to 
radical production after formaldehyde in the model. A sensitivity test, in which all 
aromatic emissions were replaced with a cyclohexane emission sufficient to pro- 
duce constant OH reactivity, leads to a 7.5% reduction in peak O; on day six of the 
simulation, demonstrating the importance of radicals derived from aromatic VOCs. 
The mono-aldehyde contribution to the radical source is dominated by the photo- 
lysis of acetaldehyde, propanal and isobutanal. 

The high rate of VOC oxidation by OH during UBWOS 2013 was due to the 
large peroxy radical source, from these radical amplification processes, combined 
with optimal NO, concentrations. During sunlight hours, an average of 89% of the 
model HO, reacts with NO to yield OH (after correcting for the instantaneous cycling 
of HO, with HO,NOj, which is a radical reservoir, not a net sink). This fraction 
reaches >98% during the peak O; production period. As every NO-to-NO, con- 
version (by reactions other than with O3) produces an Os, this efficient cycling of 
radicals by the available NO, results in highly efficient O3 production. Thus, total 
VOC oxidation is rapid despite moderate peak OH concentrations (~1.2 x 10° 
moleculescm * on day six of the simulation) due to the short OH lifetime (20- 
30 ms). The extremely high primary VOC concentrations in the Uintah Basin also 
result in OH reactions being dominated by these species rather than reaction with 
secondary oxidation products. This limits the reactive loss of the photolabile oxida- 
tion products, such as methyl-glyoxal, thus increasing the loss of these compounds 
via photolysis and increasing their dominance as the major radical source. Three- 
dimensional model analyses, required to better understand the physical processes 
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that lead to stagnation events and describe the distribution of O3, will need to re- 
produce these radical sources if they are to accurately simulate photochemical O3. 

The rapid reaction of OH with VOCs and HO, with NO results in the dominant 

radical sinks being via organic peroxy radicals (Extended Data Fig. 5b). The small 
contribution of OH reactions to the total radical sink is indicative of the efficient 
VOC oxidation, and, thus, O3 production, by OH within the model. 
Nitrous acid (HNO,). Numerous recent studies have found photolysis of HNO 
to be a significant primary source of OH radicals, particularly in polluted (high- 
NO,) environments*“***°*. It is also a potentially important radical source 
during periods of cold or snow cover. The emission of HNO, from the ground 
or snow pack has been known for some time and has been measured as part of 
snow chemistry studies in Arctic, Antarctic and mid-latitude environments!**'~. 
One of these studies reports HNO, emissions from the snow pack in Wyoming 
during high-O, episodes in 2011'*. HNO, observations during UBWOS 2013 were 
made with a relatively new technique (acetate ion CIMS”) that was suspected to 
suffer from chemical interference from peroxynitric acid (HO,NO3), a species 
modelled to have been present at part-per-billion levels during the cold and photo- 
chemically active winter of 2013. Data from the CIMS were thus not used to estimate 
the contribution of HNO, to primary radical generation for the 2013 study. Mea- 
surements of HNO) were made during the 2012 study by both CIMS and a long- 
path differential optical absorption spectroscopy** (LP-DOAS) instrument. As noted 
above, the 2012 study year was characterized by warmer temperatures, no snow 
cover and no O; events above the US national ambient air quality standard’®. 
Interference from HO,NO,j in the CIMS instrument under these conditions would 
be expected to be much smaller. The DOAS instrument was unavailable during the 
2013 study, but was present in 2014, a year with snow cover but fewer O3 excee- 
dance events relative to 2013. Extended Data Fig. 6a shows the comparison of the 
diurnally averaged HNO, from the DOAS instrument during 2012 and 2014, 
along with the CIMS measurement from 2012. The CIMS HNO, was on average 
0.032 p.p.b.v. lower than the DOAS HNO), in 2012, although this difference is 
within the 0.035 p.p.b.v. error in the DOAS HNO) determinations. The DOAS 
HNO) was remarkably similar between 2012 and 2014, suggesting that the pres- 
ence of snow cover or active photochemistry had little influence on the mixing 
ratio of HNO3. Thus, in the absence of an interference-free HNO, measurement 
during 2013, we used the daily average of the CIMS instrument from 2012 in the 
base-model simulation. The integrated OH radical production from the 2012 
CIMS HNO, was 1.6 p.p.b.v.d~', compared with 2.5 and 2.6 p.p.b.v.d~' using 
the 2012 and 2014 DOAS data, respectively. Figure 2 in the main text shows the 
sensitivity of modelled O; to variation of HNO, from zero to twice the base-case 
model. Because all estimates for the contribution of HNO, are relatively modest 
compared with net radical production in 2013, this range has little influence on the 
model-to-measurement comparison for O3. 

Inclusion in the model of HNO, at levels comparable to those suggested in ref. 
12 during winter O, events in Wyoming produces simulated O levels larger than 
observations by a factor of approximately two, similar to the conclusion of a 
separate model analysis’. Strong vertical gradients in HNO, that confine large 
concentrations to within a few metres of the snow surface, where HNO) is thought 
to be produced heterogeneously, may reconcile different measurement results and 
modelled O3 responses, but would lead to simulation results not significantly 
different from those shown in Fig. 2. If there were a large ground source of HNO, 
during the 2013 year, its contribution to the entire boundary layer could be esti- 
mated with a simple eddy diffusivity model. Classic turbulent diffusion theory 
relates the timescale of transport of a species (t) to a height above the ground (a, 
the standard deviation of a Gaussian diffusion plume) with an eddy diffusivity K,: 


t=0°/2K, 


Eddy diffusivities vary with height, being roughly proportional to height in the 
surface layer and rising to a maximum in the middle of the mixed layer®*. Extended 
Data Fig. 6b shows, at left, model results for a stable boundary layer over relatively 
smooth snow-covered terrain™. The K, value in the model varied up to just over 
10,000 cm*s~' in the middle of a shallow boundary layer of similar height to the 
boundary layer observed during UBWOS 2013. This range of K, values is consistent 
with observations during flux studies at Alert NWT® and Summit, Greenland. 
Extended Data Fig. 6b shows, at right, the relative concentration profiles corres- 
ponding to the K, values ranging from 500 to 10,000 cm’s ', obtained from the 
diffusion time and the first-order loss rate due to photolysis (Jyno2) according to 


[HNOz],/[HNOz]o = exp(—Juso29”/2Kz) 
when the midday HNO, photolysis rate was 0.0016 s_' (10 min lifetime). A height 


profile for HNO, was calculated numerically from the K,-vs-height results of ref. 64 
and the profiles as functions of K,. Under these conditions, HNO, decreases 


LETTER 


rapidly with height and impacts only the lowest few tens of metres of the boundary 
layer. Thus, even if there were a ground source as large as that inferred in ref. 12, its 
contribution to radical production integrated across the entire boundary layer 
would be unlikely to significantly exceed that used in the simulations presented here. 
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Extended Data Figure 1 | Model constraints on NO, and radical precursors 
derived from NO,. a, CINO, observations (red) and model treatment (blue). 
b, UBWOS 2012 HNO; average diurnal observations used to constrain model 
HNO,j,. c, d, NO (c) and NO; (d) observations (red) and model values (blue) 
using fixed NO emission into the model and the nitrogen partitioning 
calculated by the chemistry scheme. The data for primary emissions (for 
example NO,. or CHy) are subject to large variation owing to the influence of 


local sources that produce large, transient spikes. The model, which has the 
continuous emission characteristic of the basin-wide total, does not capture the 
transients but does capture the average. This average agreement for total NO, 
can be seen in the histogram of model deviation (e). This illustrates the 
frequency of model percentage deviation (grey) between each model and 
observation data point (both on a 10 min average). The orange fit line is a 
Gaussian fit to this data, centred on 0% deviation. 
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Extended Data Figure 2 | Derivation of photolysis rates from pyranometer 
data in 2013. a, b, Observed j(NO2) (a) and j(O'D) (b) measured via filter 
radiometer (black) during UBWOS 2012, with calculated photolysis 
frequencies, using a total downwelling radiation measurement, for UBWOS 
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2012 (red) and UBWOS 2013 (green). ¢, d, TUV-calculated j(NOz2) (c) and 
j(O'D) (d) for a surface albedo of 0.1 (purely downwelling radiation; dashed 
blue) and for a surface albedo of 0.85 (solid blue). 
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(a) 
HCHO --> 2HO> + CO 
Methyl-glyoxal --> CH3;CO3 + HO, + CO 34.1% 


9.0% 


Other Keto-aldehyde photolysis 
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Other 
14.1% 
Mono-aldehyde photolysis 


9.3% CINO, --> Cl - 1.0% 


O3 + alkene - 1.8% 

Glyoxal --> 2HO, + 2CO - 2.1% 
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Primary HNO, 
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Extended Data Figure 5 | Detailed radical sources and losses. a, Radical source contributions for day six in the model simulation. The carbonyl radical sources 
are separated by carbonyl moiety. b, Radical loss mechanisms on day six within the model. 
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vertical gradients. a, Diurnally averaged HNO, mixing ratios from the 2012 _ normalized to its concentration at the surface, as a function of height above 
(DOAS and CIMS) and 2014 (DOAS) studies. Grey and yellow shaded regions _ ground level for a series of eddy diffusivities. The black dashed line corresponds 
represent average durations of night and day, respectively. b, Left: modelled to the left-hand graph. 
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Extended Data Table 1 | Observed species used to inform the box model analysis of the ozone photochemistry during UBWOS 2013 


Observed species 


Model treatment 


Observational technique 


1,2,3-Trimethylbenzene 
1,2,4-Trimethylbenzene 
1,3,5-Trimethylbenzene 
2-Butanone 
2-Ethyltoluene 
2,2-Dimethylbutane 
2,2-Dimethylpropane 
3-Ethyltoluene 
2(2-, 3-Methylpentane) 
3,5-Diethyltoluene 
5-Ethyl-m-xylene 
Benzene 
C2H2 
C2H4 
C2He 
C3He 
C3Hg 
CH3CHO 
CH30H 
CH, 
CINOz 
co 
Cyclohexane 
Ethyl-benzene 
HCHO 


HNO2 
i-C4H10 
i-CsH12 

lsopropylbenzene 

m-Xylene 
n-C4H10 
n-CsHi2 
n-CgH14 
n-C7Hi6 

NO 

NO2 
o-Xylene 

O3 

Acetyl peroxynitrate 
Propiony! peroxynitrate 
Propylbenzene 


Toluene 


Constant emission 
Constant emission 
Constant emission 
Model calculated 
Constant emission 
Constant emission 
Constant emission 


Constant emission 


Constant emission (lumped 
as 3-Methylpentane) 


Constant emission 
Constant emission 
Constant emission 
Constant emission 
Constant emission 
Constant emission 
Constant emission 
Constant emission 


Model calculated 


Model calculated + 
constant emission 


Constant emission 


Constrained to 2013 
observed average diurnal 


Fixed at 182.84 ppbv 
Constant emission 


Constant emission 


Model calculated + 
constant emission 
Constrained to 2012 
observed average diurnal 


Constant emission 
Constant emission 
Constant emission 
Constant emission 
Constant emission 
Constant emission 
Constant emission 
Constant emission 
via NO emission 
via NO emission 
Constant emission 
Model calculated 
Model calculated 
Model calculated 
Constant emission 


Constant emission 


PTR-MS with speciation from 2012 GC-MS 

PTR-MS with speciation from 2012 GC-MS 

PTR-MS with speciation from 2012 GC-MS 
PTR-MS 

PTR-MS with speciation from 2012 GC-MS 
GC-FID 
GC-FID 

PTR-MS with speciation from 2012 GC-MS 
GC-FID 


PTR-MS with speciation from 2012 GC-MS 
PTR-MS with speciation from 2012 GC-MS 
GC-FID 
GC-FID 
GC-FID 
GC-FID 
GC-FID 
GC-FID 
PTR-MS 
PTR-MS 
Cavity ringdown spectroscopy 
I CIMS 
UBWOS 2012 campaign mean 
GC-FID + scaled UBWOS 2012 data 
PTR-MS with speciation from 2012 GC-MS 
PTR-MS 


Acid-CIMS 
GC-FID 
GC-FID 
PTR-MS with speciation from 2012 GC-MS 
PTR-MS with speciation from 2012 GC-MS 
GC-FID 
GC-FID 
GC-FID 
GC-FID 
Cavity ringdown spectroscopy 
Cavity ringdown spectroscopy 
PTR-MS with speciation from 2012 GC-MS 
Cavity ringdown spectroscopy 
lr CIMS 
I CIMS 
PTR-MS with speciation from 2012 GC-MS 
PTR-MS 
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Extended Data Table 2| Chemical and radiation measurements 
used in this analysis for modelling of UBWOS 2013 ozone events 


Species Technique Accuracy 
NO, NO2, NO,, O3 CRDS 5% 
NOs, N2Os CRDS 20% 
CINO, T CIMS 30% 
CH20 PTRMC 30% 
Speciated PANs T CIMS 15% 
Inorganic Acids (HCl, Acetate CIMS 30% 
HNO», HNO;) 

Speciated VOC GC-FID 20% 
Speciated VOC PTRMS 25% 
Speciated VOC PTR-Tof-MS 25% 
CH, CRDS 2 ppbv 
Downwelling Radiation Spectral T™% 


Pyranometer 
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Extended Data Table 3 | Radical sources in the MCM simulation on day six 


Radical Source Daily Production (ppbv) % of Total 
OCD) + H20 0.74 4.0 
CINO> Photolysis 0.18 1.0 
HNO) Photolysis 1.58 8.6 
O3 + Alkene 0.34 1.8 
H2CO Photolysis 6.29 34.1 
Other Carbonyl Photolysis 9.32 50.5 
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Helium and lead isotopes reveal the geochemical 
geometry of the Samoan plume 


M.G. Jackson', S. R. Hart’, J. G. Konter’®, M. D. Kurz’, J. Blusztajn? & KA. Farley” 


Hotspot lavas erupted at ocean islands exhibit tremendous isotopic 
variability, indicating that there are numerous mantle components” 
hosted in upwelling mantle plumes that generate volcanism at hot- 
spots like Hawaii and Samoa’. However, it is not known how the sur- 
face expression of the various geochemical components observed 
in hotspot volcanoes relates to their spatial distribution within the 
plume*”*. Here we present a relationship between He and Pb isotopes 
in Samoan lavas that places severe constraints on the distribution of 
geochemical species within the plume. The Pb-isotopic compositions 
of the Samoan lavas reveal several distinct geochemical groups, each 
corresponding to a different geographic lineament of volcanoes. Each 
group has a signature associated with one of four mantle endmembers 
with low *He/*He: EMII (enriched mantle 2), EMI (enriched mantle 1), 
HIMU (high «= *°8U/?Pb) and DM (depleted mantle). Critically, 
these four geochemical groups trend towards a common region of 
Pb-isotopic space with high *He/*He. This observation is consistent 
with several low-*He/*He components in the plume mixing with a 
common high-*He/*He component, but not mixing much with each 
other. The mixing relationships inferred from the new He and Pb 
isotopic data provide the clearest picture yet of the geochemical geom- 
etry of a mantle plume, and are best explained by a high-*He/*He 
plume matrix that hosts, and mixes with, several distinct low--He/*He 
components. 

Lavas erupted at oceanic hotspots—thought to sample melts from 
buoyantly upwelling mantle plumes’—are isotopically heterogeneous 
and provide definitive evidence that the Earth’s mantle is composition- 
ally diverse and hosts several distinct geochemical groups’”. Radiogenic 
isotopic studies of ocean island basalts erupted at hotspots identify a 
multitude of mantle compositions that are often broadly grouped into 
four endmembers with different isotopic taxonomies: EMII, EMI, HIMU 
and DM. A fifth component, characterized by high 3He/*He ratios, is 
also identified in ocean island basalts'!, and has been variously called. 
undegassed mantle”, FOZO (focus zone’*), PHEM (primitive helium 
mantle"), or C (common"). Unfortunately, the spatial distribution of 
these mantle components within upwelling mantle plumes is difficult 
to infer using geochemical studies of surficially erupted basalts. 

Recent efforts to examine the spatial distribution of mantle plume 
geochemical heterogeneities have focused on the geographic distribution 
of isotopic compositions along the surface volcanic traces of hotspot 
tracks. These studies have yielded significant progress relating the dis- 
tribution of isotopic compositions in surface lavas to the spatial distri- 
bution of isotopic components within an upwelling mantle plume. A 
well known geographic separation of isotopic compositions occurs at 
the Hawaiian hotspot, where two parallel volcanic lineaments—Loa and 
Kea—exhibit compositions that can be isotopically resolved**'*. The 
isotopic and geographic separation of the Loa and Kea volcanic lineaments 
is suggested to reflect the spatial separation of the mantle components 
within the plume that give rise to the two isotopically distinct volcanic 
lineaments, which is possibly related to the compositional structure of 


the deepest mantle*®. Similar observations of geographical and geochem- 
ical separations of parallel volcanic lineaments have been made at other 
hotspots® '*!”. These observations suggest that the geochemical struc- 
ture of many mantle plumes vary spatially in a systematic manner. 

This paper focuses on the geochemical variability along the parallel vol- 
canic lineaments of the age-progressive’* Samoan hotspot and the geo- 
chemical structure of the underlying upwelling mantle plume. We present 
36 new Pb-isotopic analyses together with new Sr, Nd and He isotopic 
measurements (Supplementary Tables 1-3) on a suite of lavas from 
the Samoan hotspot (Fig. 1 and Extended Data Fig. 1). In7°°Pb/?”*Pb- 
?°8pb/?*Pb isotopic space, four isotopic groups emerge from the data 
set (Fig. 2), and the four groups converge on a common region, roughly 
forming an X shape. Three of the four isotopic groups are defined by 
shield-stage lavas from each of three volcanic lineaments that tend to 
form separate groups in isotopic space: the islands and seamounts of 
the Vai volcanic lineament (dark blue shading and symbols); the Malu 
volcanic lineament (pink); and subaerial lavas from the Upo volcanic 
lineament (yellow) (Fig. 1). Rejuvenated-stage lavas are encountered 
only on the islands of Savai’i, Upolu and Tutuila (which define the Upo 
volcanic lineament), are younger than the shield-stage lavas on each 
island, and form a fourth isotopic group (turquoise). 

There are exceptions to this correspondence between geochemistry 
and geography (Methods). For example, the submarine lavas from the 
western region of the hotspot (dredged near Savai’i and Upolu islands) 
overlap with the isotopic compositions found in the Vai and Malu-volcanic 
lineaments further to the east and include ultra-enriched lavas’*”” unlike 
those found in any of the younger and more easterly volcanism (Fig. 2); 
this enriched material includes an additional isotopic component in 
the Samoan plume. However, there are insufficient submarine samples 
from the western Samoan region to define their isotopic taxonomy or 
geographic extent, or whether the western Samoan submarine samples 
define separate geographic trends, and we exclude these submarine lavas 
from our treatment below (see Methods); this approach is similar to that 
taken in Hawaii, where the geochemical separation of the Loa and Kea 
volcanic lineaments breaks down in the western region of the Hawaiian 
chain, and it is common to exclude some or all of the western islands 
when defining the Loa and Kea lineaments*. In this paper we examine 
Samoan rejuvenated lavas, Vai- and Malu-lineament lavas, and the sub- 
aerial shield lavas from the Upo-lineament, which form four geochem- 
ical groups. However, submarine western Samoan lavas are shown in 
the figures for clarity. 

The four geochemical groups identified in Samoan rejuvenated lavas, 
Vai- and Malu-lineament lavas, and the subaerial shield lavas from the 
Upo lineament are clearly resolved as separate clusters in multiple iso- 
topic spaces (Fig. 2; Methods). Lavas from the four groups exhibit geo- 
chemical characteristics that are associated with the four canonical mantle 
endmembers, as follows. The Malu group has EM2 characteristics, the 
Vai group has HIMU characteristics (though the signature is dilute), 
the Upo group has geochemically depleted characteristics (not unlike 


1Department of Earth Science, University of California Santa Barbara, Santa Barbara, California 93106-9630, USA. Department of Geology and Geophysics, Woods Hole Oceanographic Institution, Woods 
Hole, Massachusetts 02543, USA. ?Department of Geology and Geophysics, School of Earth and Ocean Sciences and Technology (SOEST), University of Hawaii, Manoa, Honolulu, Hawaii 96822, USA. 
*Department of Marine Chemistry, Woods Hole Oceanographic Institution, Woods Hole, Massachusetts 02543, USA. Division of Geological and Planetary Sciences, California Institute of Technology, 
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Figure 1 | Map of the Samoan hotspot showing the division of the hotspot 
into three parallel volcanic lineaments. The volcanic lineaments define three 
geochemical groups, and the colours of the data symbols in the isotopic plots 
(see Figs 1 and 2) indicate the volcanic lineament from which a sample was 
taken. Rejuvenated lavas (turquoise shaded areas) overlie shield-stage lavas on 
Tutuila and Upolu, but rejuvenated volcanism on Savai’i is extensive and all 


Hawaiian Mauna Kea lavas; Fig. 2), and rejuvenated lavas have EM1 
characteristics (see Methods). 

These four endmember groups appear to converge in a region of 
isotopic space characterized by lavas with the highest *He/*He values, 
and the ellipse describing the highest-*He/*He lavas (20-33.8 Ra, ratio 
to atmosphere)'*'”” in Fig. 2 serves asa common component region for 
the four Samoan geochemical groups (see Methods). In two- and three- 
dimensional Pb-isotopic space, the four Pb-isotopic data groups overlap 
with the common component region at the 99% confidence level (Fig. 2 
and refer to Methods). Additionally, in Pb-isotopic space, *He/*He ratios 
decrease monotonically away from the common region towards the 
extremes of the four data groups located furthest from the common 
region. We quantify this relationship by calculating the distance from 
the common component in three-dimensional Pb-isotopic space, and 
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earlier stages have been completely covered”'. However, ref. 28 infers that, like 
Upolu and Tutuila, shield-stage volcanism similar to the subaerial Upo- 
lineament lavas (called the ‘Fagaloa series’) underlies the extensive rejuvenated 
volcanism on Savai’i. The map is modified after ref. 21. Alexa seamount is 
located about 1,200 km west of Sava’i”’. 


the distance parameter is called pen (Methods). *He/*He is highest 

“ . 206/207 /208 py 

in the common component region (low D alues) and decreases 
. 206 /207/208 phy . 

away from the common component (high D ) (Fig. 3). 

The unique isotopic topology identified in Samoan lavas places impor- 
tant constraints on the distribution and mixing relationships of the var- 
ious components in the Samoan plume. The data are consistent with the 
low-*He/*He components (that is, =8 Ra) in the plume mixing with a 
high-*He/*He common component, and this mixing hypothesis is sup- 
ported by the relationship between *He/*He and pres (Fig. 3) and 
by the convergence of the four 99% confidence intervals that enclose all 
possible mixing trends for each group (Fig. 2). The four low-*He/*He 
components do not appear to mix efficiently with each other, otherwise 
the four Pb-isotopic groups would be obscured. However, some mixing 
among the low-*He/*He components has occurred, and this might explain 


Figure 2 | Pb-isotopic plot showing the isotopic separation of the volcanic 
lineaments in Samoa and the convergence of the four geochemical groups 
on the high-*He/*He component region. The colours for each data group 
are based on the geographic lineament where the samples were taken. Samples 
for which Pb-isotopic ratios were measured by high-precision techniques 
(Pb-spiked samples run by thermal ionization mass spectrometer (TIMS) and 
samples run using Tl-addition by multi-collector inductively coupled plasma 
mass spectrometer (MC-ICP-MS)) are shown as large symbols (where 
estimated external uncertainties are smaller than the symbols). Unspiked 
Pb-isotopic TIMS data are shown as small symbols (where estimated 20 
external uncertainties are better than +0.019, +0.023 and +0.076 for 
206Db 74 Pb, 7°7Pb/?4Pb and 7°°Pb/?™Pb, respectively; see Methods); example 
20 external uncertainties for unspiked Pb-isotopic TIMS data are shown. 
Subaerial Upo-lineament lavas trend towards a depleted component not unlike 
that found in mid-ocean ridge basalt (MORB) or Hawaiian lavas from the 
HSDP-2 (Hawaiian Scientific Drilling Program-II) drill core*®; Alexa” is a 
volcano in the western Samoan region that anchors the DM isotopic group 
in the Samoan suite (Methods). The submarine lavas from the western region of 
the hotspot—dredged off the coast of Savai’i and from the Tisa seamount—are 
excluded from the statistical treatment; although the lavas from the 
submarine portion of the western Samoan islands are shown with the same 
symbol (yellow diamonds), this does not imply that they are related by a 
common process or part of the same volcanic lineament. Vai-lineament lavas 
host a dilute HIMU component’’, Malu-lineament lavas host an EM2 
component”, and rejuvenated lavas sample an EM1 component’. The 
high-*He/*He common component region (grey ellipse) defines the 2c 
variance around the average in the Pb-isotopic compositions for samples with 
>He/*He > 20 Ra; the coloured shading represents 99% confidence intervals 
around the best-fit lines through each data group (see Methods). See 
Supplementary Table 4 for a compilation of the Samoan data shown. 
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Figure 3 | Relationship between He and Pb isotopic ratios in Samoan lavas. 
*He/*He is plotted versus pe?’ which represents distance from the 
common component region in Pb-isotopic space (as defined in the Methods), 
and shows that *He/*He decreases in samples moving away from the 
common component region. Samples with <10- ? cm? of “He (at standard 
temperature and pressure) per gram of sample are excluded. The different 
isotopic groups do not overlap perfectly, suggesting that the different 
endmembers have different He/Pb ratios. Errors for helium measurements 
are smaller than the data symbols. Symbols are the same as Fig. 2. See 
Supplementary Table 4 for a compilation of the Samoan data shown. 


some of the scatter in the isotopic groups; several Vai-lineament lavas 
have the isotopic composition expected for lavas from the Malu linea- 
ment, and vice versa, but overall such ‘cross-fertilization’ among the 
low-*He/*He components is limited (Methods). 

We propose a conceptual model for the geochemical geometry of the 
Samoan plume that is consistent with the mixing relationships suggested 
by the observed isotopic topology. In the model, several low-*He/*He 
components are hosted in a plume matrix composed of high-?He/*He 
material, so that each of the low-*He/*He components can mix with the 
high-* He/*He plume matrix (Fig. 4). The low-*He/*He components 
must be sufficiently isolated from each other within the plume that they 
do not easily mix, either as solids or as liquids. To achieve the geographic 
separation of the Vai and Malu lineaments, we suggest that bilateral 
heterogeneity, like that proposed for the Hawaiian plume’, must also exist 
in the Samoan plume: the component responsible for the Vai lineament 
must be located on the northern side of the plume and the component 
generating the Malu lineament must be located on the southern side 
(Fig. 4). The components are separated by sufficient distance within the 
Samoan plume that they do not mix efficiently. The Upo-lineament com- 
ponent was located higher in the Samoan plume than the Vai and Malu 
components, because most Upo-lineament lavas were erupted before 
onset of Vai- and Malu-lineament volcanism (Fig. 4). Components that 
have geochemical fingerprints similar to those identified in Malu and 
Vai volcanic lineaments are also identified in the submarine portion of 
the western Samoan islands of Savai’i and Upolu’*””, suggesting that 
the EM2 and HIMU geochemical components show up periodically 
in the plume and are not strictly limited to the Malu and Vai volcanic 
lineaments. Finally, rejuvenated lavas may sample a component located 
in the Samoan plume that is underplated on the mantle lithosphere 
beneath Samoa”, or a component hosted in the mantle lithosphere beneath 
the Samoan hotspot”, and is therefore not shown in Fig. 4. However, 
if the rejuvenated component is hosted in the mantle lithosphere, an 
important question is how it survives the high melt flux during shield- 
stage volcanism. 

Preservation of four low-*He/*He components in the upwelling Samoan 
plume, as indicated by the unique isotopic topology identified in Samoan 
lavas, presents an important problem. Dynamic modelling suggests that 
components embedded in a plume matrix will be ‘stretched out’ during 
plume ascent”, so that the components ultimately resemble spaghetti 
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Present day 


Figure 4 | Conceptual model of the geochemical geometry of the Samoan 
plume, as sampled by shield-stage lavas, and how it relates to the 
geochemical distinction among the parallel volcanic lineaments. Two 
snapshots of the Samoan plume and the associated shield volcanism are shown: 
during Upo-lineament volcanism about 2 million years ago (left) and present- 
day construction of the Vai and Malu lineaments (right). The volcanic 
lineaments in the time snapshot on the right mimic the map in Fig. 1. Colours 
of the volcanic lineaments and related mantle components in the plume 
correspond to the colours in the legend in Fig. 2. 


with their long axes oriented in a direction parallel to plume motion*”: 
the stretching has been shown to preserve the initial separation of com- 
ponents within a plume, so that their spatial relationships in the plume 
are preserved during upwelling from the deep mantle”. In this way, the 
different components are not mixed chaotically in the ascending plume 
conduit, and it is possible for various components to remain isolated 
from each other within the plume matrix. Alternative plume structures 
have been discussed for the Hawaiian plume in which the high-*He/*He 
material is at the innermost core of the plume and lower-?He/*He com- 
ponents are on the periphery of the plume”, but such a geometry would 
allow the low-’He/*He components to mix, which is difficult to recon- 
cile with the isotopic topology of Samoan lavas. 

In multi-isotopic space, different hotspots trend to a common region, 
called FOZO" or C"°, characterized by high-*He/*He. Curiously, the 
isotopic topology of Samoan lavas represents a microcosm of the global 
ocean-island-basalt data set, in which the various geochemical groups 
in multi-isotopic space converge on a common region characterized by 
high *He/*He. At the global scale, this isotopic topology is consistent 
with high-*He/*He plumes entraining low-*He/*He components in the 
deep mantle’’, perhaps in the Pacific large low shear-wave velocity prov- 
ince (LLSVP) that underlies Samoa, which is suggested to have high 
*He/*He values”***, The low-*He/*He components in the Samoan plume 
are associated with subducted materials (Methods)—oceanic crust, mantle 
lithosphere and sediments—which may also reside in “slab graveyards” 
at the bottom of the mantle where plumes originate”’. 

Wecannot rule out a model where the high-*He/*He component may 
be located deeper than the lower-*He/*He components and the latter 
components are incorporated by high-*He/*He plumes at shallower 
mantle depths'®. However, it is not clear how the low-*He/*He com- 
ponents in the Samoan plume would avoid significant admixture with 
each other during entrainment from below, because entrained products 
will remain on the periphery of the plume during upwelling and might 
interact with subsequently entrained material. If entrainment into the 
high-*He/*He plume matrix occurred in the plume source, dynamic 
models suggest that it must be a non-turbulent process that prevents 
the low-*He/*He components from mixing with each other”. 


Online Content Methods, along with any additional Extended Data display items 


and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


New Sr, Nd, Pb and He isotopic measurements. The new Pb-isotopic analyses 
were performed at the DTM (Department of Terrestrial Magnetism) and WHOI 
(Woods Hole Oceanographic Institution). Pb chemistry at DIM employs a double 
column pass and follows the method outlined in ref. 31. Pb chemistry at WHOI con- 
sists of a single column pass and follows the HBr-HNO; procedure of ref. 32 and 
ref. 33. 

As described in Supplementary Table 1, Pb isotopic compositions for a subset of 
the Samoan lavas were measured on the VG-P54 MC-ICP-MS at DTM in February 
2009. Refer to ref. 34 for methods of Pb-isotopic measurement. Thallium was added 
as an internal isotopic standard to correct for instrumental mass fractionation. All 
measurements were made during a single day-long analytical session, during which 
time eight separate runs of the NIST 981 standard were made, and the reproducib- 
ility of the eight measurements was 840 parts per million (p.p.m.), 820 p.p.m. and 
800 p.p.m. for the 206Db/?* Pb, 7°7Pb/?*Pb and 7°8Pb/?™Pb ratios (20, standard 
deviation of the mean). The Pb-isotopic data on basaltic samples run during this 
analytical session were normalized to the average value determined by the 981 ana- 
lyses that bracket the basaltic analyses over the analytical session. The data were 
normalized to the NIST 981 values reported by ref. 35: 7°°Pb/*™*Pb = 16.9356, 
7°7Pb/?4Pb = 15.4891 and 7°Pb/?™Pb = 36.7006. 

The remaining Pb-isotopic measurements reported in Supplementary Table 1 
were made on the Neptune multicollector ICP-MS at WHO] over six separate ana- 
lytical sessions from November 2003 to October 2008. Note that many of the samples 
measured for Pb isotopes at WHOI were previously measured by lower-precision 
TIMS methods (that is, unspiked analyses), and we report new, higher-precision 
analyses here. Again, a T] internal standard was used to correct for instrumental 
mass fractionation. The long-term external reproducibility on NIST 981 standard 
runs at WHOI is <120 p.p.m. for the *°°Pb/?™Pb, *°’Pb/*™*Pb and 7°8Pb/?™*Pb 
ratios”””*. Basaltic sample runs were normalized to the average NIST 981 value mea- 
sured in a given analytical session using NIST 981 values reported by ref. 35. 

Supplementary Table 2 includes new *’Sr/**Sr and '*°Nd/'*Nd isotopic measure- 
ments. Analyses made at WHOI follow chemical separation techniques outlined in 
ref. 29; Sr and Nd isotopic measurements were made on the Neptune multicollec- 
tor ICP-MS at WHOL, and standard normalization and long-term external pre- 
cision are described in ref. 29. The Sr and Nd isotopic analyses made at the Scripps 
Institution of Oceanography follow analytical techniques outlined in ref. 14. 

Supplementary Table 3 includes new helium isotopic measurements, which were 

measured at WHO] and at the Scripps Institution of Oceanography. Measurements 
made at WHOI follow the methods outlined in ref. 37. All data were obtained by 
crushing olivines or glasses in vacuum, except for one sample, which is a measure- 
ment of gas released by fusion following a crushing experiment. In-run precision is 
reported in the Supplementary Table 3. Measurements made at the Scripps Insti- 
tution of Oceanography follow the methods outlined in ref. 14, where uncertainty 
in *He/*He ratios is estimated to be +0.5 Ra. 
Calculating distance in three-dimensional Pb-isotopic space. The expression for 
distance in Pb multi-isotopic space, here called D"”’””’""?®, is given by the following 
relationship: D°?””"*P® = [(?°%Pb/?*Pb, — 2°°Pb/?™Pbp)/X)* + ((7°7Pb/?™Pbs 
— 7b? Pbp)/Y)? + (7OSPH/?™Pbs — 7°SPb/?™*Pbg)/Z)"]°°, where the subscript 
R indicates the isotopic composition of the reference sample, the subscript S is the 
isotopic composition of any Samoan lava sample, and X, Y and Z represent the ab- 
solute difference between the maximum and minimum values measured in Samoan 
hotspot lavas (including rejuvenated lavas and all lavas from the Vai, Maluand Upo 
lineaments) for *°°Pb/?™*Pb (X = 19.4993 — 18.5720 = 0.9273), °°” Pb/?™Pb (Y = 
15.6510 — 15.5538 = 0.0972) and 7°Pb/?*Pb (Z = 39.8620 — 38.6929 = 1.1691), 
respectively. The reference isotopic composition is chosen to be the highest-*He/*He 
lava from Samoa (Ofu-04-06)”°, which plots near the region of convergence of the 
four isotopic groups. 

The expression for distance (De “Pb) in Pb-isotopic space is based on the 
Pythagorean theorem, where the differences between each Pb-isotopic ratio and 
the common component are squared, these squared differences are then summed, 
and the square root of the sum is taken. The expression for distance in Pb-isotopic 
space (that is, D””””"?>) also accounts for the fact that different Pb-isotopic ratios 
exhibit dramatically different variability. To do this, the difference in the Pb-isotopic 
ratio between two data points is divided by the total range measured in Samoan 
lavas. For example, the total range in 206Db/?"4Pb in Samoan lavas is 0.9273, which 
is the difference between the highest measured 206 pb /?™Pb ratio (19.4993) and the 
lowest ratio (18.5720). Thus, the °*°Pb/*™Pb ratio exhibits 5.0% variability in Samoan 
lavas. This is important because, in Samoan lavas, 208 pb /?>*Pb varies from 39.8620 
to 38.6929 in Samoa, which represents ~3.0% variability, while 7°’ Pb/*™Pb exhibits 
only 0.6% variability (15.6510 to 15.5538 in Samoan lavas). If the distance equation 
did not normalize the isotopic difference between data points by the total range in 
the isotopic ratio of interest, then the distances calculated would be dominated by 
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the 7°°Pb/?™*Pb and 7°°Pb/*™Pb variability, and *°’Pb/?™Pb (which has limited 
variability in Samoa and in the ocean-island-basalt mantle in general) would con- 
tribute very little to the overall distance calculated. 

We emphasize that the different isotopic groups converge on a common region 

rather than a point in Pb-isotopic space. No single sample in the existing Samoan 
data set, including Ofu-04-06, is ideally suited as a point of convergence for all the 
geochemical groups in Samoa. However, given the limited available data set on 
high-*He/*He Samoan lavas, and given the observation that Ofu-04-06 plots near 
the region of convergence of the different isotopic groups, we choose the isotopic 
composition of this lava as a reference isotopic composition in the Pree equa- 
tion because it has the highest *He/*He. 
Statistical test for the convergence of the four Pb-isotopic groups. The relation- 
ship between *He/*He and distance from the common component region in Pb- 
isotopic space—where *He/*He decreases with increasing distance (higher Doerr 
values) from the common component region—is consistent with a model in which 
the four low-*He/*He components in the Samoan plume mix with a common high- 
*He/*He component. To evaluate whether the four different isotopic groups— 
represented by lavas from the Malu, Vai and subaerial-Upo lineaments and rejuvenated 
lavas (Extended Data Fig. 1)—converge on the high-’He/*He common component 
region in Pb-isotopic space, we first define the high-*He/*He common component 
region to be comprised of all Samoan lavas with *He/*He > 20 Ra. Since many sam- 
ples measured for Pb isotopes were not suitable for *He/*He measurements, this 
allows us to compensate for potential under-sampling of the high-*He/*He com- 
mon component and to define uncertainty in the composition of the common com- 
ponent. We model the compositional range of the common component region with 
the mean and the variance of the Pb-isotope data (and Nd-isotopic data) for sam- 
ples with *He/*He > 20 Ra (see Fig. 2 and Extended Data Figs 2 and 3). This results 
in an ellipsoid centred on the mean value and with major axes defined by the 20 
variations in the heavy radiogenic isotopic compositions. Subsequently, we test 
whether the four data groups indeed converge on this ellipsoidal common com- 
ponent region, taking advantage of the linearity of mixing relationships in Pb- 
isotopic space. To model the possible orientations of mixing lines for each group, 
we use best-fit trends through each group and their related confidence intervals. 
We computed Working-Hotelling confidence intervals at the 99% confidence level 
to be as inclusive as possible in visualizing possible mixing lines for each group (we 
tested 95% confidence as well, and the conclusions are the same). In three-dimensional 
Pb-isotopic space, the 99% confidence intervals (they appear as ‘tubes’ in three- 
dimensional isotopic space; Fig. 2 and Extended Data Figs 2 and 3) around each of 
the best-fit trend lines overlap with the ellipsoid that encompasses the common 
component region that is defined by the highest-*He/*He lavas. This observation 
shows that, for each data group, there exists a family of mixing lines (within error 
of the best-fit line) that statistically overlaps with our common component region. 
This isotopic overlap also maps to two-dimensional Pb-isotopic spaces (see Fig. 2 
and Extended Data Fig. 3), although the three-dimensional case shows that only a 
subset of possibilities in individual two-dimensional plots is actually possible. In 
summary, all four of the best-fit trend lines through the four different isotopic groups 
statistically overlap with the common component region in Pb-isotopic space. 

However, we note that submarine lavas from western Samoan (that is, lavas dredged 
off the coast of Sava’i and Upolu) are excluded from this statistical test. Available 
data from these lavas do suggest that they follow the relationship between *He/*He 
and distance from the common component region in Pb-isotopic space (Fig. 2), 
which is consistent with mixing with the common component region. However, 
the limited sampling of the submarine portion of the western Samoan region makes 
it difficult to assign the few available samples to geographic groups: For example, 
only Savai’i (four dredges: ALIA114, ALIA115, ALIA116 and ALIA128) and the 
Tisa seamount (one dredge; ALIA113) have samples of the deep submarine por- 
tions of the western Samoan region, and several of the dredges are distal to the Upo 
volcanic lineament. Thus, the vast submarine region in western Samoa, which spans 
around 200 km (from Savai’i to Tisa), is represented by only five submarine dredges, 
of which several were distal to Savai’i and Upolu, which do not clearly belong to the 
Upo-lineament (indeed, the distance between the northernmost and southern- 
most dredges in the western Samoa region is >130 km, which is more than twice 
the distance that separates the Vai and Malu volcanic lineaments). Our statistical 
approach requires that the Samoan data are grouped along geographic trends, and 
we test whether these geographically defined data groups converge on the high- 
>He/*He component region in isotopic space. With such poor sampling along the 
submarine portion of the western Samoan islands, and the improbability that these 
dredges sample the same geographic lineament, it is not yet possible to evaluate 
geographic groups in the isotopic data sets in the submarine portion of the western 
Samoan region. 

Geographic separation of the isotopic components in Samoa. The geographic- 
isotopic groups (the Vai, Malu and Upo volcanic lineaments and rejuvenated lavas) 
are resolved in multiple isotopic spaces (Fig. 2, Extended Data Figs 2 and 3). Below, 
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we describe the seamounts and islands that comprise the four geographic groups 
that are resolved in isotopic space: 

(1) The Vai volcanic lineament is comprised of the islands of Ofu and Ta’u, and 
Vailulw’u, Soso, Tamai’i and Muli seamounts. In isotopic space, including *’°Pb/?™Pb 
versus -8Pb/7™Pb, °°Pb/?™ Pb versus” Pb/?™Pb and 2°Pb/™ Pb versus “3Nd/"“4Nd, 
some of the Vailulu’u lavas trend slightly outside of the data group formed by other 
Vai-lineament volcanoes. These anomalous lavas trend in the direction of Malu- 
lineament lavas in isotopic space, indicating that Vailulu’u samples components 
similar to that found in the mantle sources of both the Vai and Malu volcanic lin- 
eaments, but that lavas from the Vai-lineament rarely sample the component hosted 
along the Malu volcanic lineament. Two Vailulu’u samples from the same dredge— 
AVON3-73-1 and AVON-3-73-12—plot squarely in the field defined by Malu- 
lineament lavas, and show that, like the Hawaiian Loa and Kea volcanic lineaments”, 
the isotopic separation of the Malu and Vai volcanic lineaments is not perfect. Ano- 
ther lava from the Vai volcanic lineament, Ta’u sample T44, plots closer in isotopic 
space to lavas that comprise the Malu volcanic lineament. Ta’u sample T14, which 
exhibits highly unradiogenic Pb and anomalous Pb concentrations relative to other 
Tau lavas'’, may be contaminated and is not shown here. 

(2) The Malu volcanic lineament is geographically displaced to the south of the 
Vai volcanic lineament and is comprised of the Malumalu, Malutut and Tulaga 
seamounts and the Masefau shield lavas of northeast Tutuila. We find that three 
Malu-lineament lavas from dredges 108 (DR108) and 109 (DR109) from the ALIA 
cruise'*!***°° have Vai-lineament-like geochemistry, again showing that the geo- 
chemical distinction between the two parallel volcanic ridges is not perfectly resolved 
in isotopic space. 

The Malu volcanic lineament joins seamlessly with the northeast region of the 
island of Tutuila. Tutuila lavas with geochemistry similar to Malu-lineament lavas, 
referred to as Masefau shield lavas’, outcrop only along the north coast of the nor- 
theastern region of the island along the Afono, Maefau and Sailele bays (J. Natland, 
personal communication, 2013). Masefau lavas have not been encountered any- 
where else on Tutuila, with the possible exception of sample 91-TP-252, which was 
collected on the southern coast of the northeast portion of the island (however, this 
sample is a cobble, so its true provenance is unknown). Less geochemically enriched 
Tutuila lavas, referred to as Pago shield lavas'*“°, outcrop on the rest of the island. 
Using radiometric age data from ref. 41, lavas identified geochemically as Pago shield 
lavas in the western region of the island are generally younger than lavas identified 
as Masefau shield lavas on the eastern side of the island (see ref. 41 and references 
therein). Indeed, the Masefau shield sequences are cross-cut by dikes with Pago 
shield geochemical signatures'*. Work pairing geochemistry with age data is rela- 
tively scarce in Samoa, and further work is needed to evaluate the temporal rela- 
tionship between the Masefau and Pago shield series. However, from existing data, 
we argue that Tutuila is divided between two volcanic series, and that the younger 
Pago shield series overlies much of the older Masefau series, much like the Hawaiian 
Mauna Loa series overlies the Mauna Kea series in the HSDP2 drill core** (we note 
that these two volcanoes, Mauna Kea and Mauna Loa, sample the separate volcanic 
lineaments along the Hawaiian hotspot, just as the Pago and Masefau shield sample 
two separate volcanic lineaments in Samoa). We further argue that the two volcanic 
series on Tutuila anchor the Upo and Malu volcanic lineaments, both geochemi- 
cally and geographically. The Malu volcanic lineament joins with the northeastern 
region of Tutuila where the isotopically related Masefau lavas are encountered, and 
we argue that Tutuila Masefau lavas anchor the westernmost portion of the Malu 
lineament (and the Tutuila Pago lavas anchor the easternmost limb of the Upo 
volcanic lineament; see below). 

(3) In the Upo lineament, Pago shield lavas from Tutuila are isotopically similar 
to subaerial Upolu shield lavas (subaerial Upo-lineament lavas from Upolu have 
previously been referred to as Fagaloa shield lavas)’, and they outcrop on the western 
and southern portions of Tutuila (and as dikes cross-cutting the Masefau shield lavas 
in northeast Tutuila). We argue that the Pago shield lavas mark the easternmost 
extent of the Upo volcanic lineament, while Tutuila Masefau lavas define the west- 
ernmost extent of the Malu volcanic lineament. Thus, Tutuila Island acts as a nexus 
between the Upo lineament (Pago shield series on southern Tutuila) and the Malu 
lineament (Masefau shield series on northern Tutuila). In contrast to all other vol- 
canic structures in Samoa, which strike along a direction parallel to absolute plate 
motion (WNW to ESE), the island of Tutuila is oriented obliquely to plate motion 
(WSW to ENE)*". Thus, the Malu lineament ‘steps off to the northeast, away from 
the Upo lineament, and Tutuila effectively bridges these two volcanic lineaments. 

Shield lavas from Upolu have similar isotopic compositions to Pago shield lavas 
on Tutuila (Fig. 2, Extended Data Fig. 3), and the Upo volcanic lineament is com- 
prised of lavas from at least two islands. Fagaloa series lavas that define the subaerial 
Upo lineament also may exist on Savai’i, but if they do, they have been completely 
covered with young rejuvenated lavas**. 

The volcanic stage suggested for two subaerial Upolu lavas may require new des- 
ignations, based on their geochemical characteristics. Upolu samples U30L and 


U10S (see ref. 17) were tentatively classified in the field as rejuvenated-stage lavas 
on the basis of a geological map from ref. 43. Later isotopic analyses’” showed that 
these two samples plot in the field of shield lavas (M. Regelous, personal commun- 
ication, 2013), and we argue that these two samples belong to the shield stage. We 
note that Pb-isotopic data reported on Tutuila lavas from the Pago shield (and Upolu 
lavas) by ref. 44 are severely contaminated and are excluded from the present study”’. 

(4) Rejuvenated lavas from the Samoan islands of Savai’i, Upolu and Tutuila have 
variable extents of rejuvenated lava cover, from only minor rejuvenated volcanism 
cover on Tutuila to near-complete cover of the shield stage by rejuvenated lavas on 
Savai’i’'. In radiogenic isotopic spaces, the rejuvenated lavas form a separate field 
that is resolved from shield-stage lavas, and the rejuvenated lavas from each island 
are isotopically similar (Fig. 2, Extended Data Fig. 3). The origin of Samoan reju- 
venated lavas—which are volumetrically extensive—is not well understood, but is 
thought to relate at least partially to tectonically enhanced melting caused by 
tectonic stresses in the region generated by the nearby Tonga trench!771787#6, 
Geochemical identity of the five mantle components in the Samoan plume. The 
identity of the four low-*He/*He mantle species in the Vai and Malu lineaments, 
the subaerial Upo lineament and in rejuvenated lavas can be determined by exam- 
ining their geochemical characteristics. Below, we argue that the four low-’He/*He 
components in the Samoan plume sample the canonical mantle endmembers— 
EM1, EM2, HIMU and DM—in variably diluted forms. A fifth component, which 
is characterized by having high *He/*He, has geochemical signatures that give clues 
to its origin: 

(1) In the Malu lineament, the lavas have geochemical signatures associated with 
the EM2 mantle endmember, including negative Ti (and Nb, not shown) anom- 
alies (Extended Data Fig. 4), and very high 87Sr/°°Sr (refs 17 and 19). The earliest 
shield-stage lavas on Savai’i, dredged on the deep submarine flanks of Savai’i (4.8 
to 5.3 million years ago)'*, have isotopic and trace element signatures of an even 
more extreme EM2 component (with *’Sr/**Sr up to 0.7216)" that complements 
the EM2 signature that characterizes Malu lineament volcanics'*'”*”*. The EM2 
component in Samoa is best modelled by recycling subducted, continentally derived 
sediment into the Samoan mantle“. 

(2) In the Vai lineament, lavas have geochemical signatures that are similar to the 
HIMU mantle endmember (albeit in greatly diluted form). Vai-lineament lavas have 
the highest (U + Th)/Pb (Extended Data Fig. 5) and the most radiogenic 206pb/?™4Pb 
(Fig. 2, Extended Data Fig. 3) in the Samoan suite. High (Th + U)/Pb is a char- 
acteristic of HIMU lavas”. Clearly, Vai-lineament lavas do not exhibit the highly 
radiogenic Pb-isotopic compositions evident in HIMU endmember locations like 
Mangaia and Tubuai (for example, ref. 51), but we argue that Vai-lineament lavas 
host a dilute HIMU component that is not sampled in pure endmember form in 
Samoa, at least in lavas studied to date. The most extreme HIMU components 
in the mantle have been linked to ancient recycled oceanic crust (see, for example, 
refs 51-54). 

(3) In the Upo lineament, subaerial lavas trend towards a depleted mantle com- 
position in various isotopic spaces (Fig. 2, Extended Data Figs 3 and 4), and are 
characterized by the highest '“*Nd/'“*Nd in the Samoan suite. Supporting this hypoth- 
esis, “*Nd/'“*Nd increases along the subaerial Upo-lineament group moving away 
from the high-*He/*He common component region (as prP> increases) in the 
direction of MORB and Hawaiian lavas from the HSDP-2 drill core (Fig. 2, Extended 
Data Fig. 4). By comparison, the other three data groups exhibit relatively constant 
or even decreasing ‘*Nd/"“*Nd with increasing distance from the high-*He/*He 
common component region. (Curiously, '“7Nd/'“*Nd exhibits the least amount of 
variability in the high-*He/*He common component region—where Dee? ig 
near zero, which indicates that the common component region has relatively homo- 
geneous isotopic characteristics.) Thus, subaerial lavas from the Upo-lineament 
group appear to sample a depleted mantle composition, a hypothesis that is sup- 
ported by the observation that average MORB” and Hawaii Kea lavas (which exhibit 
geochemically depleted *Nd/'“*Nd) anchor the data trends for Upolu shield and 
Tutuila Pago lavas in all isotopic spaces’’. The western Samoan seamount Alexa” 
anchors the most extreme terminus of the Samoan DM isotopic group sampled by 
subaerial Upo-lineament lavas; data from this seamount are shown in all relevant 
isotopic spaces and the data plot near the field for Hawaiian Kea lavas. Therefore, 
the geochemical component sampled by Upo-lineament lavas is not unlike the 
depleted plume component sampled by Hawaiian Kea lavas, and may be inherent 
to the Samoan plume, but we cannot exclude a depleted upper mantle source (like 
that which sources MORB) as the endmember sampled by Upo-lineament lavas. 

(4) In the rejuvenated lavas from Samoa there is a mild EM1 component, and 
this is supported by positive Ba-anomalies (high Ba/Th in Supplementary Table 4) 
in Samoan rejuvenated lavas'’”?”*, a geochemical characteristic shared with EM1 end- 
member lavas from Pitcairn”. Relative to Samoan shield lavas, Samoan rejuvenated 
lavas exhibit elevated *°°Pb/*™Pb at a given *°Pb/?™Pb, a characteristic that is also 
shared with EM1-like lavas globally. 
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(5) The variably diluted mantle endmembers identified in Samoan lavas—EM2, 
EM1, HIMU and DM—are known to have relatively low 3He/*He values!7!9*°?, 
These low-*He/*He components anchor the four isotopic groups in the region of 
isotopic space furthest from the high-*He/*He common component region. However, 
lavas with relatively high to very high *He/*He are found on several islands and 
seamounts (up to 33.4 Ra on Ofu, 18.0 Ra on Ta’u, 25.8 Ra on Tutuila, 19.3 Ra on 
Upolu, 15.9 Ra on Malumalu, 15.3 Ra on Muliand 18.6 Ra on Savai’i), and these lavas 
tend to plot near the high-*He/*He common component region (Fig. 3; Extended 
Data Fig. 6). High-*He/*He lavas in Samoa have high Ti/Ti* (defined in ref. 19) 
values®® (in contrast to the low Ti/Ti* observed in the Samoan EM2 endmember; 
Extended Data Fig. 4). Consistent with this observation, Ti/Ti* is highest near the 
high-’He/*He common component region; moving away from the centre of the 
common component region (from high to low D'"”””""P values), Ti/Ti* decreases 
in the direction of the four low-*He/*He endmembers (Extended Data Fig. 4). This 
relationship between Ti/Ti* and *He/*He is highlighted by the elevated Ti/Ti* observed 
in high *He/*He lavas globally”. 

Samoan lavas with high *He/*He are melts of a mantle component with a con- 
troversial origin, and these lavas cluster in the common component region in iso- 
topic space (Extended Data Fig. 6). Reference 14 argued that the high-*He/*He 
Samoan component represents primitive, undegassed mantle, and called this com- 
ponent PHEM (primitive helium mantle). However, higher *He/*He ratios (up to 
33.8 Ra) identified in Samoan lavas have '“*Nd/'“4Nd that are geochemically depleted 
relative to primitive, chondritic material, and Pb-isotopic compositions that are far 
from the geochron”. Nonetheless, high-*He/*He Samoan lavas have lower (geo- 
chemically more enriched) ‘*Nd/'*Nd than lavas with high *He/*He from other 
hotspots”, and it was suggested that relatively young (about ten million years old) 
sediments rapidly cycled from the Tonga trench might account for this Nd-isotopic 
shift***. However, the Pb-isotopic compositions in Samoan lavas preclude the exis- 
tence of a young sediment component in the Samoan mantle”’, and sediments from 
the Tonga trench in particular are a poor fit for Samoan lavas in Pb-isotopic space 
(see figure 3 in ref. 20). The origin of the enriched geochemical signatures in Samoan 
high-*He/*He lavas is not known, but ref. 20 suggested the involvement of a DUPAL 
signature—a globe-encircling band of isotopic enrichment observed in Southern 
Hemisphere hotspot lavas“—in the Samoan high-*He/*He mantle that shifts the 
*°Nid/'4Nd to lower values than observed in Northern Hemisphere high-*He/*He 
hotspot lavas. The origin of the geochemically enriched DUPAL signature is not 
known, but might relate to ancient (about 1.8 billion years old) subducted material™. 
Thus the unique heavy radiogenic isotopic signature of the Samoan high-*He/*He 
plume component may result from the incorporation of ancient subducted material 
into the high-*He/*He mantle. 
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Extended Data Figure 1 | Sample locations and volcano ages. The range of 
ages for each location (subaerial or submarine dredge) is provided in a box at 
the periphery of the map, and a yellow line connects each location with the 
respective age data; not all samples with geochemical data have age data 
(indeed, most Samoan samples with geochemical data, submarine and 
subaerial, do not have age constraints). Dredge locations are labelled with a red 
line: dredges from the 1999 AVON2/3 cruise aboard the RV Melville” have 
dredge numbers less than 100, and dredges from the 2005 ALIA cruise aboard 
the RV Kilo Moana'*'”” have dredge numbers greater than 100. Samples 
collected on land were taken from the five Samoan islands (and are labelled with 
yellow stars: Savai’i subaerial, Upolu subaerial, Tutuila subaerial, Ta’u subaerial 
and Ofu subaerial). Malumalu and Vailulu’u seamount ages are based on 
uranium-series disequilibrium, and therefore maximum ages are provided™®. 
Upolu subaerial lavas include both rejuvenated series (which bracket the 
younger limit of ages) and the shield series (which bracket the older limit of 
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ages); poor outcrop exposure on the highly vegetated Samoan islands can 
make designation of the volcanic stages difficult (particularly if geochemical 
data are not available for the hand sample), and an average age for the 
rejuvenated or shield stages on Upolu is therefore not provided. Rejuvenated 
lavas are present on Tutuila, but ages are not available in the literature. All 
reported subaerial lavas from Savai’i are rejuvenated, indicating that the island 
has been covered with a veneer of rejuvenated volcanism*’”*. Rejuvenated 
volcanism has been observed during historical times on Savai’i, which was last 
active from 1905-1911 (ref. 67); error bars are not provided for the oldest 
Savai’i subaerial sample in ref. 17. Submarine samples dredged off the coast of 
Savai’i (D114, D115 and D128) and from Tisa seamount were dredged distal to 
the Upo lineament and may not belong to this lineament. All available ages 
for Samoan islands and seamounts are provided in refs 17, 18, 39, 40, 41, 65, 66, 
68 and 69. (Ma, million years.) 
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Extended Data Figure 2 | A three-dimensional presentation of the Pb- 
isotopic groups shows that they converge on the high *He/*He common 
component region. 99% confidence intervals (appearing as ‘tubes’) around the 
best-fit lines through each of the four data groups—Malu lineament (pink 
tube), Vai lineament (dark blue), subaerial Upo lineament (yellow) and 
rejuvenated lavas (light blue)—are shown in three-dimensional Pb-isotopic 
space. The composition of the common component region is modelled as an 
ellipsoid (grey) that is defined by the 2¢ variance around the average in the 
Pb-isotopic compositions for samples with *He/*He >20 Ra. In three- 
dimensional Pb-isotopic space, the 99% confidence intervals around each of 
the best-fit trend lines overlap with the ellipsoid that encompasses the common 
component region. Each tube represents an estimate of the error around the 
best-fit trend to the data defining each geographic lineament. The tube 
therefore encloses the set of all possible mixing arrays associated with a given 
geographic lineament. Since all the tubes intersect the ellipsoid of the common 
component region, statistically a range of mixing arrays exists for each 
geographic lineament that passes through the common component region. 
This result is consistent with the compositional data of the four lineaments 
mixing with the high-*He/*He common component. 
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Extended Data Figure 3 | The isotopic composition of the four Samoan data 
groups are shown in Nd and Pb isotopic spaces. In both panels, the 
high-*He/*He common component region is modelled by a grey ellipse that 
defines the 2¢ variance around the average of the heavy radiogenic isotopic 
compositions of Samoan lavas with *He/*He > 20 Ra. The left panel shows the 
four data groups identified in Pb-isotopic space (Fig. 2) in a plot of '*°Nd/"4Nd 
versus 7°°Pb/?™Pb. Samples for which Pb-isotopic ratios were measured by 
high-precision techniques (Pb-spiked samples run by TIMS and samples run 
using Tl-addition by MC-ICP-MS) are shown as large symbols (where 
estimated 2¢ external uncertainties are smaller than the symbols'”””*'””**), 
and unspiked Pb-isotopic TIMS data are shown as small symbols (where 
estimated 26 external uncertainties are equal to or better than +0.076 for the 
?°8Pb/?™Pb ratio, as shown'*!7“851). The right panel shows the four data groups 
identified in Fig. 2 in a plot of ?°°Pb/?™Pb versus 7°’Pb/?™Pb. Samples for 
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which Pb-isotopic ratios were measured by high-precision techniques 
(Pb-spiked samples run by TIMS and samples run using Tl-addition by 
MC-ICP-MS) are shown as large symbols (where estimated 20 external 
uncertainties are smaller than the symbols, except for samples run on the P54 at 
Carnegie, where estimated 2c external precision error bars are shown on the 
individual data points, as reported in the Methods); unspiked Pb-isotopic 
TIMS data are shown as small symbols (where estimated 2o external 
uncertainties are equal to or better than +0.019 and +0.023 for 7°°Pb/?™*Pb 
and 7°’Pb/?Pb, respectively, as shown). 99% confidence intervals around the 
best-fit lines through each data group overlap with the high-*He/*He common 
component region. Symbols are the same as in Fig. 2 of the main text. The 
MORB average composition is from ref. 55. The HSDP-2 drill core data are 
from refs 24 and 30. See Supplementary Table 4 for a compilation of the 
Samoan data shown; Alexa data are from ref. 29. 
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Extended Data Figure 4 | Various geochemical signatures show clear 
trends with increasing distance from the common component region in 
Pb-isotopic space. The top panel shows a plot of pb versus Ti/Ti*. 
Samoan lavas with the highest *He/*He have the highest Ti/Ti* and the lowest 
Deere values; this follows from an earlier observation that Ti/Ti* correlates 
with *He/*He in Samoan lavas, and the high-? He/*He mantle reservoir has 
elevated Ti/Ti* (ref. 60). Ti/Ti* is defined in ref. 19. Only lavas with 

MgO > 7 wt% are shown, to avoid the effects of fractional crystallization of 
trace phases that might fractionate the trace element ratios. A sample with 
high MnO from Soso (ALIA110-39) is excluded owing to a high degree of 
alteration. ALIA-115-07, which is highly altered, is also excluded, as are all 
samples from ALIA Dredge 118. Samples with He concentrations <10° ? cm? 
of “He at STP per gram of sample (olivine) are excluded. Additionally, only 
shield-stage lavas are plotted. The middle panel shows Deer’? versus 
'®nd/'4Nd. ‘?Nd/'“4Nd shows systematic behaviour in each data group 
moving away from the common component region (that is, with increasing 


DeerPPb) in Pb-isotope space. Data from subaerial Upo-lineament lavas 


12 14 16 1. 


(yellow) exhibit increasing '**Nd/'“*Nd with increasing distance (higher 
206/207/208 ply : : 
D ) from the common component region, and this supports the 
hypothesis that the subaerial portion of the Upo lineament samples a depleted 
mantle (DM) component similar to that found in the Alexa seamount and 
Hawaii. The other data groups (from the rejuvenated lavas and the Vai and 
Malu volcanic lineaments) all exhibit lower (more enriched) ‘“*Nd/"“4Nd with 
increasing distance from the common component region. Finally, the middle 
panel shows that '*°Nd/'4Nd exhibits the least amount of variability in the 
common component region—where pe? is zero—as the four isotopic 
groups converge on a common component with relatively homogeneous 
isotopic characteristics. The bottom panel shows D’” "Pb versus *He/*He 
(also shown in Fig. 2 of the main text) for comparison with the other panels. 
Symbols are the same as in Fig. 2. The MORB average composition is from 
ref. 55. The HSDP-2 drill core data are from refs 24 and 30. When calculating 
Ti/Ti*, only data obtained by ICP-MS (except Ti, which is measured by X-ray 
fluorescence) are used. See Supplementary Table 4 for sources of the 
Samoan data; Alexa data are from ref. 29. 
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Extended Data Figure 5 | (U+Th)/Pb versus Ba/Th. Vai-lineament lavas 
exhibit the highest (U+'Th)/Pb values in Samoa, consistent with a HIMU 
signature. Such high (U+'Th)/Pb values are consistent with the radiogenic 
Pb-isotopic compositions in Vai-lineament lavas and similar to the high 
(U+Th)/Pb values observed in HIMU lavas. Samoan rejuvenated lavas, 
which have an EM1 signature, have high Ba/Th (and Ba/Sm and Ba/Nb); 
this Ba-enrichment matches the positive Ba-anomalies observed in EM1 
endmember lavas from Pitcairn”. Highly altered samples and samples with low 
MgO are excluded (as described in Extended Data Fig. 4). Ref. 17 identified 
Upolu sample U10 as an outlier in many isotope and trace element spaces. 
Symbols are the same as in Fig. 2 of the main text. Only data obtained by 
ICP-MS are shown. See Supplementary Table 4 for a compilation of the 
Samoan data shown. 
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Extended Data Figure 6 | The *He/*He ratios of Samoan lavas are shown in 
colour (warmer colours represent higher *He/*He) to show the distribution 
of *He/*He ratios in 7°SPb/?™Pb versus 7°°Pb/?™Pb isotopic space. Lavas 
with the highest *He/*He tend to cluster near the region in Pb-isotopic space 
where the four Pb-isotopic data groups converge, and lavas with lower *He/*He 
tend to plot farthest from the common component region. Samples with 
<10 ° cm? of “He at STP per gram of sample (olivine) are excluded. 

See Supplementary Table 4 for a compilation of the Samoan data shown. 
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Site-specific group selection drives locally adapted 


group compositions 


Jonathan N. Pruitt’ & Charles J. Goodnight? 


Group selection may be defined as selection caused by the differential 
extinction or proliferation of groups’”. The socially polymorphic 
spider Anelosimus studiosus exhibits a behavioural polymorphism 
in which females exhibit either a ‘docile’ or ‘aggressive’ behavioural 
phenotype**. Natural colonies are composed of a mixture of related 
docile and aggressive individuals, and populations differ in colonies’ 
characteristic docile:aggressive ratios”®. Using experimentally con- 
structed colonies of known composition, here we demonstrate that 
population-level divergence in docile:aggressive ratios is driven by 
site-specific selection at the group level—certain ratios yield high sur- 
vivorship at some sites but not others. Our data also indicate that col- 
onies responded to the risk of extinction: perturbed colonies tended 
to adjust their composition over two generations to match the ratio 
characteristic of their native site, thus promoting their long-term sur- 
vival in their natal habitat. However, colonies of displaced individuals 
continued to shift their compositions towards mixtures that would 
have promoted their survival had they remained at their home sites, 
regardless of their contemporary environment. Thus, the regulatory 
mechanisms that colonies use to adjust their composition appear to 
be locally adapted. Our data provide experimental evidence of group 
selection driving collective traits in wild populations. 

In societies in which individual fitness is tightly linked with the per- 
formance of the group, the theory of group selection predicts that evolu- 
tion will favour traits in individuals that aid in maximizing their group’s 
success—which, in turn, are predicted to increase individuals’ long-term 
evolutionary interests’*. Here we define group selection as selection caused 
by the differential extinction or proliferation of groups’. This represents 
a broad definition that is not in any way adversarial to the importance 
of kinship selection for social evolution’. Although the basic idea of group 
selection has intuitive appeal, its success as a general explanation of adap- 
tive social evolution has been marred by critiques of its reasoning and 
usefulness'®"”, In this paper we provide compelling experimental evid- 
ence that group selection drives locally adapted group compositions in 
wild populations. 

The social spider A. studiosus exhibits a discrete and heritable (Ex- 
tended Data Fig. 1) behavioural polymorphism in which individuals dis- 
play a ‘docile’ or ‘aggressive’ phenotype*”’. In nature, colonies are composed 
of a mixture of related docile and aggressive individuals, and the mixture 
of types within colonies has large consequences for collective behaviour 
and colony reproductive success**"*, We also observe site-specific docile: 
aggressive mixtures (Fig. 1a), which may reflect local adaptation, such 
that different sites favour different ideal compositions. Notably, A. studiosus 
exhibits high rates of colony extinction events** and limited dispersal*®, 
two attributes that are thought to increase the power of group selection 
as an evolutionary force'*’* (see Supplementary Discussion 1 for more 
natural history information). 

To determine whether site-specific docile:aggressive mixtures are a 
result of group selection, we generated an array of artificial colonies of 
known, variable compositions and deployed them at six field sites: three 
high-resource sites (Melton Hill, Tennessee; Little River, Tennessee; and 
Moccasin Creek, Georgia) and three low-resource sites (Norris Dam, 
Tennessee; Clinch River, Tennessee; and Don Carter, Georgia) (Extended 


Data Fig. 2). We determined 53 random combinations of colony size 
(1-27 females) and composition (0-100% aggressive) and deployed an 
identical array at each site. Thirty-seven colonies were composed of indi- 
viduals taken from the site where they were subsequently deployed (that 
is, ‘native’ individuals), and 16 colonies were composed of individuals 
taken from a paired site of the opposing resource level (that is, ‘foreign’ 
individuals). Females assigned to experimental colonies all came from 
the same source colony. This design allowed us to test whether site of origin 
influenced selection on colony composition and/or colonies’ ability to 
approximate an optimal composition over time. We deployed these arrays 
of native and foreign colonies in three paired, reciprocal transplant experi- 
ments between high- and low-resource sites: Melton Hill was paired with 
Norris Dam, Little River with Clinch River (all in Tennessee), and Don 
Carter with Moccasin Creek (both in Georgia). If group selection is a 
major selective force that has caused local adaptation in this system, we 
predict that (1) compositions that approximate the normal mixtures that 
characterize each site will enjoy greater success, and (2) colonies should 
only be able to adaptively hone their compositions when composed of 
native individuals, either because the cues that spiders use to sense their 
colonies’ ailing compositions or their responses to those cues will be 
site-specific and locally adapted. 

We monitored the success of experimental colonies over the next two 
generations and noted all instances of colony extinction. We also mon- 
itored the composition of 20 naturally occurring ‘local’ colonies at each 
site. We used these naturally occurring colonies to assess whether our ex- 
perimental colonies exhibited uncharacteristically low or high extinc- 
tion rates. 

The naturally occurring relationship between colony size and com- 
position differed across sites (general linear model (GLM) site < colony 
size: F589 = 294.27, P < 0.0001; Fig. 1a). At our high-resource sites, small 
colonies were dominated by docile females and the frequency of aggress- 
ive individuals increased with colony size. By contrast, at low-resource 
sites, small colonies were dominated by the aggressive phenotype and 
the frequency of the docile phenotype increased with colony size (Fig. 1a). 
The fact that different sites exhibit different relationships between colony 
size and composition raises the question as to why some compositions 
are site-specific or whether the observed compositions are locally adapted. 

The success of our experimental colonies depended on how well their 
starting compositions matched the naturally occurring mixture at each 
site (GLM dissimilarity: likelihood ratio (L-R) 7°) = 52.81, P< 0.001). 
The more similar a colony’s composition was to the naturally occurring 
local mixture, the higher its probability of surviving. Thus, although there 
was no significant relationship between colony size and composition in 
experimental colonies at the start of our experiment (Fig. 1b), there was 
a significant relationship between colony size and composition two gen- 
erations later (Fig. 1c). The surviving colonies at each site form a size/ 
composition relationship approximating those of naturally occurring 
colonies, which differed between high- and low-resource sites (Fig. la 
versus c). Therefore, site-specific group selection, as mediated by colony 
extinction events, appears to drive the size/composition relationships that 
characterize each site. Colonies’ reproductive output was also tightly assoc- 
iated with how well they approximated the naturally occurring mixture 
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a Naturally occurring mixture 


b Initial experimental mixtures. 
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Figure 1 | Site-specific group selection. a-c, Scatterplots depicting the colony 
size versus composition relationship at six riparian sites. a, The naturally 
occurring, local size/composition relationship. b, The array of 53 experimental 
colonies of various size/composition combinations deployed at each test site 


at each site (GLM dissimilarity: F,297 = 15.91, P< 0.001). Experimental 
colonies with compositions resembling local colonies produced nearly 
ten times as many offspring colonies as those bearing moderately dis- 
similar mixtures, and those with extremely dissimilar mixtures never 
produced any offspring colonies. 

Among the experimental colonies that survived, the compositions of 
some colonies tended to move closer to the local and successful size/com- 
position relationship while others moved markedly further away. Shifts 
in colonies’ mixtures depended on whether colonies were composed of 
native versus foreign individuals (F ratio: F), 1307 = 33.35, P < 0.001; Fig. 2). 
Colonies composed of native individuals tended to become more sim- 
ilar to the local mixture, whereas foreign colonies became significantly 
more dissimilar. Instead, foreign colonies adjusted their compositions 
to more closely approximate a mixture that would have promoted their 
survival had they remained at their site of origin (paired t-test: t; = 3.77, 


P<0.001). In other words, foreign colonies tracked the ideal compositions 
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Colony size 


(16 points not visible owing to identical size/composition). ¢, The size/ 
composition combinations of experimental colonies that survived two 
generations in the field. P values are the result of univariate regressions. These 
field experiments were replicated once at each site. 


of their home sites regardless of their contemporary environment, and 
they did this despite having persisted in their new environment for 
multiple generations. These findings provide compelling evidence that 
the mechanisms that colonies use to regulate their compositions are them- 
selves locally adapted, presumably because of the survival advantages 
that they confer to the colony. 

Group selection is potentially a powerful and persistent force in A. 
studiosus. Natural populations of A. studiosus have characteristic ratios 
of docile:aggressive individuals, and these ratios affect colony success**. 
Here, we demonstrate that the ideal ratios of docile:aggressive indivi- 
duals vary among sites, and that naturally occurring colonies exhibit ratios 
that promote colony survival across generations. These results suggest 
that colonies have evolved to exhibit an ideal site-specific trait mixture, 
and that the differential survivorship and reproductive success of groups 
(that is, group selection'*"°) is the driving force. Notably, A. studiosus 
colonies are composed of related individuals", thus colonies differ from 
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Figure 2 | Colonies can hone ailing mixtures. Graph depicting the 

average change in colonies’ dissimilarity from the local mixture: high-resource 
(top panels) and low-resource sites (bottom panels). Colonies were released 
at each site and were composed of either ‘native individuals’ that were collected 
from the same site where they were subsequently deployed, or ‘foreign 
individuals’ that were collected from a paired site of the opposing resource 
levels. Positive values indicate that colonies became more dissimilar to the local 
mixture, whereas negative values indicate that colonies became more similar 
to the local mixture. P values for site-by-site comparisons are depicted on the 
graph (t-tests). Bars represent standard error of the mean. These field 
experiments were replicated once at each site. 


each other genetically. This, in turn, can help to explain the apparent 
marked evolutionary response to group selection in this system. Addi- 
tionally, our study followed groups for two generations, meaning that 
all of the original spiders died during the course of the experiment and 
at the end of the study we were sampling the behaviour of their off- 
spring’s offspring. Thus, the patterns shown here should reflect an evo- 
lutionary response to group selection, and not only those patterns of group 
selection that could lead to an evolutionary response. Our observation 
that groups matched their compositions to the one optimal at their site 
of origin (regardless of their current habitat) is particularly important 
given that many respected researchers have argued that group selection 
cannot lead to group adaptation except in clonal groups’® and that group 
selection theory is inefficient and bankrupt'””’. 

The group selection literature is frequently criticized because it is 
often not clear whether or how group selection has actually caused the 
evolution of any trait. Our data here provide evidence that A. studiosus 
has responded to group selection by evolving the capacity to avoid low- 
performing trait mixtures. Interestingly, this ability was specific to spi- 
ders collected and redeployed at their home sites, and the ability was lost 
when colonies were placed in a novel habitat. If colonies of foreign indi- 
viduals had been able to adaptively adjust their compositions regard- 
less of their contemporary environment (home versus away) this would 
have provided evidence that the ability was entirely plastic. Yet, we found 
the opposite trend: displaced colonies continued to hone their compo- 
sitions in ways that would have promoted their survival had they remained 
at their home sites. Thus, we reason that group selection has favoured 
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colonies’ ability adaptively to adjust their composition, and either the 
cues that colonies use to assess their ailing compositions or the actions 
colony members take in response are site-specific and genetically influ- 
enced (a gene X environment interaction). How native spiders are actu- 
ally able to adjust their composition is unknown, but plausible regulatory 
mechanisms include developmental plasticity in the docile:aggressive 
phenotypes, policing of group membership, phenotype-biased dispersal, 
and/or selective cessation of reproduction. That said, we disfavour the 
first hypothesis that developmental plasticity in the docile:aggressive 
phenotypes has a large role, since the docile: aggressive distinction is both 
highly repeatable (r = 0.70) and heritable (h? = 0.66, Extended Data Fig. 1). 
However, neither estimate approximates 1. Thus, we must acknowledge 
the possible role that developmental plasticity in these traits may have 
in this system. Still, we propose that aspects of behavioural plasticity in 
response to the social environment (for example, selective eviction of 
fellow group members or cessation of reproduction) are the more likely 
causes. 

Why different sites favour different ideal compositions is unknown, 
but relevant selection pressures could include social parasitism and egg 
case cannibalism*. The ecological factors associated with extinction dif- 
fered across the sites in our study (GLM site X social parasite abund- 
ance: L-R 7’, = 24.41, P< 0.001; GLM site X proportion of egg cases 
cannibalized: L-R v7; = 20.93, P< 0.001). Egg case cannibalism was 
associated with colony extinction at all three low-resource sites but none 
of the high-resource sites, and the abundance of social parasites (het- 
erospecific spiders) within colonies was associated with extinction at all 
three high-resource sites but none of the low-resource sites (Extended 
Data Fig. 3). Thus, the correlates of extinction are tightly linked with sites’ 
resource levels, and this could explain why sites with similar resource 
levels also exhibit similar size/composition relationships and outcomes 
of group selection. 

Our study extends a strong historical body of work on group selection 
by conducting careful experimental manipulations on natural popula- 
tions. First, there are studies that show that laboratory or domestic popu- 
lations can respond to group selection’? ”. However, laboratory studies 
typically have selection imposed by the investigator and are certainly 
not ‘natural’ settings. Second, there are studies showing that group selec- 
tion acts in natural populations or in large mesocosms, for example, work 
on harvester ants**~* or studies on water striders**. These studies con- 
firm that we cannot ignore the importance of group selection in nature. 
However, such studies are based on phenotypic selection and they have 
never documented variation in group selection across environments. 
Last, there are studies that show that there are adaptations that appear 
to be the result of multilevel selection**’’. What was missing from this 
literature until now was an experimental field study that tied all of these 
elements together. Our study shows group selection acting in a natural 
setting, on a trait known to be heritable, and that has led to a colony- 
level adaptation. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Collection and laboratory maintenance. Mid-instar A. studiosus were collected 
along six riparian habitats: Norris Dam, Tennessee (36°13’27'’ N 84°5'29'' W); 
Melton Hill, Tennessee (35°53'7'’ N 84°18’0'’ W); Moccasin Creek, Georgia (34° 
50’41.69’’ N 83° 35'17.11'’ W); Little River, Tennessee (35°32’40'’ N 84°3'1’' W); 
Clinch River, Tennessee (35°53’33.46'' N 84°1'4.96’’ W); and Don Carter, Georgia 
(34°23'15.43'’ N 83°44'47.26'' W). Colonies from Norris Dam and Melton Hill 
were collected in March 2010. Colonies from the remaining sites were collected in 
February 2013. Colonies were collected by placing the colony within a cloth pillow- 
case and trimming off the supporting branches using pruning snips. Colonies were 
transported back to the laboratory at the University of Pittsburgh and dissected out 
by hand. Individual spiders were housed in 59 ml plastic delicatessen cups contain- 
ing a tangled ball of poultry wiring to facilitate web construction. Spiders were 
maintained on an ad libitum diet of termite workers and fed twice weekly until they 
reached maturity. Upon reaching maturity, the behavioural phenotype of each indi- 
vidual was determined using the established inter-individual distance test described 
later. Females were mated randomly to a male of like behaviour type from their same 
source population, but which was collected from a source colony >5 m distance. The 
average dispersal distance of this species is 30-40 cm*®. 

Inter-individual distance assay. Two females of unknown tendency were indi- 
vidually marked with fluorescent powder and placed in the centre of a clear plastic 
container (13 X 13.5 X 2.5cm). After 24h of settling time, we measured the dis- 
tance between them. All females that exhibited an inter-individual distance greater 
than zero (that is, they were not in direct contact) were run through a second con- 
firmatory test with a known docile female (that is, one that previously exhibited an 
inter-individual distance score of zero). This test is necessary to tease apart the two 
types of females, because aggressive females demand space and chase away docile 
females. Females that exhibited an inter-individual distance <7 cm in the second 
confirmatory test were categorized as ‘docile’ and females that exhibited an inter- 
individual distance >7 cm were categorized as ‘aggressive’. Seven centimetres corre- 
sponds to a natural break in the distribution of inter-individual distance measures 
between the two phenotypes’. Inter-individual distance scores are repeatable over 
individuals’ lifetimes, heritable (Extended Data Fig. 1), and highly correlated with 
several other aggressiveness and boldness measurements”. 

Although aggressive females demand 7 cm (or more) space in this assay, this does 

not translate entirely cleanly to the spatial organization of females in natural col- 
onies. Anecdotally, aggressive females seem to position themselves on the outskirts 
of colonies. And, colonies composed of all or mostly aggressive females tend to have 
fewer individuals per unit web volume. 
Colony establishment and release. Females were assigned to experimental colonies 
within 1 week of their maturation, and painted with a unique pair of coloured dots 
atop their cephalothorax using fast-drying modelling paint. Experimental colonies 
were constructed of varying sizes and compositions, ranging from 1-27 females 
and 0-100% aggressive individuals. Fifty-three mixtures were determined at ran- 
dom using a random number generator in Excel (Microsoft 2010). These same 
mixtures were deployed at each of our six study sites (total n = 318). Thirty-seven 
of these colonies were composed of individuals taken from the same source site 
where they were subsequently deployed (native individuals), and 16 colonies were 
composed of individuals taken from a paired site of an opposing resource. This pro- 
cedure allowed us to observe whether site of origin influenced selection on colony 
composition and/or individuals’ ability to approximate their optimal compositions 
over time. Sample sizes reflect a balance of feasibility of replication and our desire 
to maximize our statistical power. We used three high- versus low-resource site pairs 
to execute three reciprocal transplant experiments of identical design: Melton Hill 
(high) with Norris Dam (low), Little River (high) with Clinch River (low) and 
Moccasin Creek (high) with Don Carter (low). 

Colonies were first housed in 473 ml clear plastic cups, each containing a com- 
pact ball of poultry wiring to facilitate web construction. After 7 days of web con- 
struction, spiders were given an ad libitum meal of immobilized 4-week old crickets. 
Colonies were then allotted another 5 days to construct their webs before being 
established. 

Release localities were selected using pre-existing, naturally occurring colonies 
as indicators of habitat quality. At each locality, the resident colony was removed and 
replaced with a randomly selected experimental colony. We then searched the adja- 
cent foliage 4 m around each experimental colony and removed all naturally occur- 
ring colonies. We allotted a minimum of 6 m between the placements of experimental 
colonies. This permitted us to count the number of descendent colonies produced by 
each experimental colony. Colonies that appeared in the immediate vicinity (<1 m) 
of an experimental colony were assumed to be descendants of the nearby experi- 
mental colony, since 95% of individuals disperse within 2 m of their natal webs. 
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Colony monitoring. Colonies were checked every 3-4 months for the next 14-18 
months and all colony extinction events were noted. We monitored the colonies at 
Melton Hill and Norris Dam for 18 months and we monitored the colonies deployed 
at the other four sites for 14 months. Colonies were deemed to have gone extinct if 
no living A. studiosus could be observed within the colony, the colony lacked fresh 
silk deposition, and no newly established offspring colonies were observed within a 
1 m radius of its original release locality, which could indicate that individuals fled 
their ailing colonies immediately before their colony collapsed. A 1 m radius is suf- 
ficient to track >95% of all dispersing A. studiosus*”’. Furthermore, dispersal routes 
of individual females are tracked with relative ease because females deposit a thread 
of dragline silk as they disperse through the environment, which literally highlights 
their dispersal routes. In this particular study we failed to observe any incipient 
colony formation associated with the extinction of our experimental colonies. This 
further indicates that there were no surviving individuals. 

Wealso recorded the number of foreign spiders (social parasites) within colonies, 

the number of prey actively being consumed or struggling in a 2 min scan sample, 
and the proportion of egg cases cannibalized during the height of the reproductive 
season once for each colony. These metrics were used to test for associations between 
various ecological factors/pressures and colony survival. At the end of the 14-18- 
month period, the surviving colonies were re-collected and their size and composi- 
tion was determined using the protocols described earlier. Observers were blind to 
colony composition when recording colony vitals. 
Field census of natural colonies. To ensure that the patterns observed in our experi- 
mental colonies resembled that of natural colonies at either test site, 20 natural colonies 
of varying sizes at each site were selected and monitored over 14-18 months. We 
haphazardly removed all visible residents (1-28 spiders) using an aspirator in April 
2011, determined their aggressive/docile phenotype, and returned them to their 
source colony within 48 h. We then tracked the survivorship of these colonies over 
14-18 months. The rate of colony extinction events in these colonies (20 per site) 
was compared against our experimentally reconstituted colonies, and against 15 
entirely unaltered colonies per site. This allowed us to compare the extinction rates 
of colonies that experienced differing levels of experimental invasiveness, and to 
determine whether our protocols generated unnaturally high/low extinction rates. 
Statistical methods. Data were inspected for normally distributed residuals and 
heteroscedasticity before analysis. To assess whether colonies’ size/composition rela- 
tionship differs among sites, we used a general linear model to predict the number 
of aggressive females within colonies, with colony size (total number of females), 
site identifier, and their interaction term as predictor variables, and the number of 
aggressive females as our response variable. To test whether the determinants of 
colony extinction differed among sites, we used a multiple logistic regression model 
with colony size, composition, release site, spiders’ site of origin (native versus 
foreign), colonies’ dissimilarity, dissimilarity X release site, composition X colony 
size, and composition X colony size X site as predictor variables, and survival as a 
binary response variable. Colonies’ dissimilarities were calculated as the distance of 
each colony from the naturally occurring regression of colony composition (num- 
ber of aggressive females) on colony size at each site. This distance reflects the 
dissimilarity of each experiment colony from the naturally occurring composition 
in demographic space. To test whether colonies composed of native versus foreign 
spiders differentially shifted their dissimilarity from the naturally occurring size/ 
composition relationship, we used a matched/pair test to compare the same colony 
at its starting distance to its distance at the end of the study. We then compared the 
change in distance of colonies composed of native versus foreign individuals using 
anested analysis of variance (ANOVA) with individual observations nested within 
release site and release site designated as a random effect. We also ran separate 
t-tests for each release site independently. Finally, we determined whether colonies 
of foreign individuals continued to track the mixtures that characterize their home 
site by using an omnibus paired difference test with release site included as a random 
effect. We ran all of these analyses four times, first with the proportion of aggressive 
females as our measure of colony demography, second with the number of aggress- 
ive females as our measure of colony demography, third with the average aggres- 
siveness of colony constituents as our measure of colony demography, and fourth 
with the number of aggressive females as our measure of colony demography but 
with all singleton colonies dropped from the analyses. Nearly identical patterns of 
significance were obtained for all three analyses (Supplementary Note 1). All statis- 
tics were conducted using JMP 10.0 (SAS Institute). 


28. Pruitt, J.N., Riechert, S. E. & Jones, T. C. Behavioural syndromes and their fitness 
consequences in a socially polymorphic spider, Anelosimus studiosus. Anim. 
Behav. 76, 871-879 (2008). 

29. Pruitt, J. N., Cote, J. & Ferrari, M. C. O. Behavioural trait variants in a habitat- 
forming species dictate the nature of its interactions with and among 
heterospecifics. Funct. Ecol. 26, 29-36 (2012). 
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Extended Data Figure 2 | The average group size, number of social 
parasites, numbers of prey captured in colonies’ webs, and the proportion of 
egg cases cannibalized at the height of the reproductive season for three 
high-resource sites and three low-resource sites. The sites looked at were 
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Don Carter (DC), Norris Dam (NR). Data presented here represent the 
averages obtained from 20 randomly selected naturally occurring colonies at 
each site. Error bars represent standard error of the mean. 
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cannibalized at the time of colony extinction. Colony extinction events were 
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Hallucigenia’s onychophoran-like claws and the 


case for Tactopoda 


Martin R. Smith’ & Javier Ortega~Hernandez! 


The Palaeozoic form-taxon Lobopodia encompasses a diverse range of 
soft-bodied ‘legged worms’ known from exceptional fossil deposits’. 
Although lobopodians occupy a deep phylogenetic position within 
Panarthropoda, a shortage of derived characters obscures their evo- 
lutionary relationships with extant phyla (Onychophora, Tardigrada 
and Euarthropoda)”**’*"’. Here we describe a complex feature in 
the terminal claws of the mid-Cambrian lobopodian Hallucigenia 
sparsa—their construction from a stack of constituent elements— 
and demonstrate that equivalent elements make up the jaws and claws 
of extant Onychophora. A cladistic analysis, informed by develop- 
mental data on panarthropod head segmentation, indicates that the 
stacked sclerite components in these two taxa are homologous— 
resolving hallucigeniid lobopodians as stem-group onychophorans. 
The results indicate a sister-group relationship between Tardigrada and 
Euarthropoda, adding palaeontological support to the neurological'’*”” 
and musculoskeletal'*”’ evidence uniting these disparate clades. These 
findings elucidate the evolutionary transformations that gave rise to 
the panarthropod phyla, and expound the lobopodian-like morpho- 
logy of the ancestral panarthropod. 

Palaeozoic lobopodians feature prominently in discussions about 
the origins of crown-group panarthropods—the extant velvet worms 
(Onychophora), water bears (Tardigrada) and euarthropods (Euarth- 
ropoda)**""”°, Although lobopodians have been regarded as onycho- 
phoran ancestors’, the presence of ‘primitive’ characters—such as a 
terminal radial mouth, unsclerotized annulated cuticle, a non-segmented 
body and terminal claws in the walking legs—suggests a deeper phylo- 
genetic position’*’’. Because lobopodians have few derived morpho- 
logical features in common with extant panarthropod phyla, there has 
been much disagreement over the precise affinities of these extinct 
organisms and their significance for the origins of the major extant 
groups? 2012142021, 

Here we describe the fine morphology of exceptionally preserved ter- 
minal claws in the Burgess Shale lobopodian H. sparsa (mid-Cambrian; 
Stage 5), and demonstrate a fundamentally similar construction in the 
claws and jaws of the extant onychophoran Euperipatoides kanangrensis. 
These new data clarify both the affinity of ambiguous lobopodians and 
the evolutionary origins of extant panarthropods. 

H. sparsa bears two types of sclerite: a pair of appendicular sclerites 
(claws) on each walking leg, and seven pairs of armature sclerites (spines) 
along the trunk (Fig. 1a). The claws form smooth curves that subtend 
an angle of 100°, and comprise a stack of three constituent elements 
(Fig. 1b-d), separated by 21° of displacement along a logarithmic curve 
denoted by the Raupian parameters” W = 3, T= 0, D = 72. The pre- 
served carbon film thins gradually towards the base of the claw, reflect- 
ing a lesser degree of sclerotization. 

Hallucigenia spines each comprise a stack of one to five constituent 
elements® that are separated by 1-6° along the logarithmic spiral given 
by W = 3, T = 0, D = 1.07. Spines that have been compressed obliquely 
to their plane of curvature express a smaller value of D, representing 
a preservational artefact (Extended Data Fig. 1). The surface of each 
constituent element is characterized by an ornament of regularly ar- 
ranged scales (Extended Data Fig. 2). 


Onychophorans lack armature sclerites, but possess two types of ap- 
pendicular sclerite: paired terminal claws in the walking legs, and den- 
ticulate jaws within the mouth cavity”. As in H. sparsa, claws in E. 
kanangrensis exhibit a broad base that narrows to a smooth conical point 
(Fig. le-h). Each terminal claw subtends an angle of 130° and comprises 
two to three constituent elements (Fig. 1e-h). Each smaller element pre- 
cisely fills the basal fossa of its container, from which it can be extracted 
with careful manipulation (Fig. le, g, h and Extended Data Fig. 3a-g). 
Each constituent element has a similar morphology and surface orna- 
ment (Extended Data Fig. 3a-d), even in an abnormal claw where 
element tips are flat instead of pointed (Extended Data Fig. 3h). The 
proximal bases of the innermost constituent elements are associated with 
pigmented tissue (Fig. le and Extended Data Fig. 3e-h). 

The jaws of E. kanangrensis represent a modified set of trunk 
appendages** whose paired sclerites exhibit two distinct morphologies: 
the outer sclerite (Fig. 1j) resembles a claw, but has one or two accessory 
denticles on its concave edge; the inner sclerite (Fig. 1i) bears six to eight 
accessory denticles. These sclerites each comprise two stacked elements; 
the distal outline of each internal constituent element corresponds to 
the outline ofits containing element enlarged by 2.4 + 2.7% (Extended 
Data Fig. 1). Proximally, the internal element is truncated with respect 
to its containing element; thus all elements terminate along a common 
basal line (Fig. 1fand Extended Data Fig. 3g). The constituent elements 
of the jaw are separated by 21° of displacement along a logarithmic curve 
denoted by the parameters W = 3, T= 0, D=8. 

We regard the internal constituent elements in the claws and jaws of 
E. kanangrensis as future replacements of the outermost element. This 
is supported by the uniform shape and sculpture of the constituent ele- 
ments within both claws and jaws, the tendency of each element to increase 
in size relative to its container, the separation of elements upon mech- 
anical preparation, and the logarithmic trajectory of successive elements. 
The presence of a single constituent element in shed onychophoran 
exuviae~>* indicates that two to three elements characterize the inter- 
moult individual; this suggests that ecdysis involves discarding the out- 
ermost element, secreting a new innermost element, and extending the 
bases of all existing elements—presumably via the pigmented basal tissue. 

The constituent elements of E. kanangrensis jaws and claws are dis- 
tinct from the superimposed sclerites found in some ecdysozoans. The 
duplicated sclerites that occur in certain Palaeozoic lobopodians and 
palaeoscolecids”* represent the displacement of one individual sclerite 
by another during growth; upon completion of ecdysis, the displaced 
sclerite would have been shed. In such cases, each element is fully grown 
when it is sclerotized, so each internal element extends proximally be- 
yond the margin of its containing element; this is not the case in ony- 
chophorans. Some euarthropods, such as ostracods and spinicaudatan 
branchiopods, retain multiple exuviae after ecdysis”’; here, overlying 
moults are retained on the carapace during ontogeny, and continue to 
accumulate as the individual grows. This contrasts with the stacked 
elements in Euperipatoides, the outermost of which is shed during 
ecdysis”*. Unlike the elements of onychophoran sclerites, the overlying 
exuvia of the former crustacean carapace does not correspond mor- 
phologically with the underlying exuviae; nor does it share a common 
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baseline contact with the epidermal tissue. Thus the constituent elements 
of onychophoran claws and jaws neither represent superposition dur- 
ing moulting, nor the partial retention of moult exuviae; rather, they 
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Figure 1 | Appendicular sclerites in H. sparsa and E. kanangrensis. a-d, H. 
sparsa. a, Royal Ontario Museum (ROM) 61124, exhibiting dorsal armature 
sclerites (spines) and appendicular sclerites (claws, arrowed; image courtesy of 
J.-B. Caron); b, ROM 63051, single claw with three constituent elements, 
innermost partly dissociated, cf. e; c, d, ROM 57776, claw with three intact 
constituent elements (image courtesy of J.-B. Caron). e-j, E. kanangrensis 
(Onychophora, Recent). e-h, Claws; e, air bubble between middle and outer 
elements; basal pigmented tissue (pig); fh, pair of claws from single limb 

(f, single claw comprising three stacked elements; g, h, single claw separated 
into outer element (g) two stacked inner elements (h)); i, j, jaw sclerites, with 
two constituent elements (i, inner jaw sclerite; j, outer jaw sclerite). Scale bars, 
a, 1,000 ttm; b-d, 100 pum; e, 40 ptm; f-j, 100 pm. 


reflect the early formation of future moult elements. Being absent in 
Euarthropoda or Tardigrada (Extended Data Fig. 4a, b), this feature is 
diagnostic of Onychophora. 

An onychophoran-like mode of development is inferred for the claws 
and spines of H. sparsa, which also exhibit multiple constituent ele- 
ments, logarithmic growth from a basal accretionary zone, consistent 
morphology during growth and—verified at least on the dorsal spines— 
a scaly ornament on the proximal region. Taken together, these features 
support the homology between the claws of hallucigeniid lobopodians 
and the appendicular sclerites of extant onychophorans, also identifying 
enigmatic organic-walled microfossils as claws of stem-onychophorans 
(Extended Data Fig. 4c-e). This distinctive mode of growth suggests 
that a common process regulated the development of armature scler- 
ites and appendicular sclerites in H. sparsa, despite their different loca- 
tions. This could represent a shared evolutionary origin, perhaps as 
armour plates on an ancestral worm-like ecdysozoan”*, or the express- 
ion of limb-patterning genes in a novel location; a similar situation is 
observed in extant insects, where limb-patterning genes (for example, 
Distal-less) are associated with the development of ventral appendages 
as well as dorsal structures that may not have an appendicular origin 
(for example, wings)”. 

To test the homology of the stacked elements in onychophorans and 
H. sparsa, we analysed the evolutionary relationships of Palaeozoic lobo- 
podians. Our data matrix is informed by recent findings on the segmen- 
tal organization of the panarthropod head (Supplementary Note 1), and 
yields a substantially resolved strict consensus tree that is robust to a 
wide range of homology penalization—indicating a strong phylogen- 
etic signal. The resultant topology consistently recovers H. sparsa and 
Onychophora in a clade that ancestrally bore tall spines, characterized 
by differentiated deutocerebral appendages and sclerites constructed 
from stacked constituent elements (Supplementary Note 2, transfor- 
mation series 34-35, 10, 39)—indicating that the latter represents an 
evolutionary innovation of total-group Onychophora (Fig. 2). Palaeozoic 
lobopodians are recovered as paraphyletic?’*'*"*”', and can be broadly 
categorized according to their position relative to panarthropod crown 
groups. Aysheaia is the only taxon resolved in the stem-lineage of Pan- 
arthropoda (per refs 1, 21; contra refs 3, 11, 14); an alternative—but 
less supported—position within stem-Euarthropoda was only recov- 
ered at low concavity values (28% of those sampled; see Supplemen- 
tary Data). The results indicate a major dichotomy within Panarthropoda. 
On one side of this basal split is total-group Onychophora, defined by 
the limbless posterior extension of the lobopodous trunk, undifferenti- 
ated posterior appendages and the loss of radially symmetrical circu- 
moral structures (Supplementary Note 2, transformation series 61, 63, 19). 
Stem-group Onychophora includes Diania (contra refs 5, 11), Xenusion, 
Paucipodia, Antennacanthopodia and all lobopodians with sclerotized 
dorsal elements except Onychodictyon ferox®. Luolishania and the 
Emu Bay Shale “Collins’ monster” occupy a derived position within 
a paraphyletic Hallucigenia grade. Antennacanthopodia and Ilyodes 
represent the closest relatives of Onychophora, indicating the second- 
ary loss of dorsal sclerotized elements in the crown group. 

On the other side of the basal panarthropod split, our analysis recov- 
ers a second major clade that includes the tardigrade and euarthropod 
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Figure 2 | Panarthropod phylogeny. Strict consensus of all most 
parsimonious trees recovered under equal weights (concavity constant k = ~) 
and implied weights at 99 values of k (all most parsimonious trees listed in 
Supplementary Data). The origin of stacked elements in sclerites is 
reconstructed assuming their absence in Cardiodictyon; a deeper origin is 
possible otherwise. Nodes are annotated with the percentage of weighting 
parameters (values of k) that support the node (values of 100% are not shown). 
Illustrated taxa are marked in bold type; the morphology of the ancestral 
panarthropod is inferred from the most parsimonious character distribution 
(Supplementary Note 2). EBS, Emu Bay Shale. 


total groups as sister taxa (per refs 10, 14, 15 and contra a more con- 
ventional grouping of Euarthropoda + Onychophora”), ancestrally 
bearing radially symmetric circumoral structures, appendicules on the 
lobopodous limbs and a modified posterior trunk appendage (Supplemen- 
tary Note 2, transformation series 19, 49, 63). This result corroborates 
the Tactopoda hypothesis"®, which has recently been reinvigorated by the 
pattern of ‘tritocerebral’ innervation of the stomatogastric ganglion’’, the 
segmentally ganglionated nerve cord with a parasegmental organization’® 
and the metamerically arranged longitudinal musculature shared bet- 
ween these phyla’*"’. Within this framework, Onychodictyon ferox? is 
resolved as a stem-group tardigrade—consistent with hypotheses that 
the microscopic size of tardigrades is derived and that lobopodians 
include ancestors of this phylum’*!*”*. An alternative position for O. 
ferox in stem-Euarthropoda was also recovered, but only at exceedingly 
low concavity values. Total-group Euarthropoda includes various dis- 
parate forms united by the ancestral presence of fused protocerebral 
appendages bearing series of spines/spinules, ultimately transformed 
into the euarthropod labrum®”° (Supplementary Note 2, transforma- 
tion series 12-17). The gradual evolutionary transition from lobopo- 
dians with spinose frontal appendages (Jianshanopodia, Megadictyon) 
through gilled lobopodians (Kerygmachela, Pambdelurion, Opabinia) 
and anomalocaridid-type taxa (Peytoia, Anomalocaris, Hurdia) to stem 
euarthropods with full body arthrodization (for example, fuxianhuiids) 
is in overall agreement with previous reports”’’’*”°. These relation- 
ships reveal the parallel evolution of key innovations associated with 
the origins of panarthropod phyla; for example, the independent ventral 
migration of the mouth in crown-Onychophora’, Heterotardigrada’® 
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and stem-Euarthropoda”, and the non-homology between the lip pa- 
pillae of Onychophora®* and the circumoral structures that support 
Tactopoda (for example, lamellae in Tardigrada”’, radial mouthparts in 
anomalocaridids”'). 

The finding that sclerites with stacked constituent elements are di- 
agnostic of total-group Onychophora, in combination with a develop- 
mentally informed phylogenetic analysis, fundamentally improves the 
resolution of panarthropod relationships relative to their lobopodian 
ancestors. Consistent with the basal position of Aysheaia, Siberion and 
Onychodictyon species within their respective stem-lineages, our analysis 
indicates that the ancestral panarthropod was probably a macroscopic 
lobopodian with heteronomous body annulations, an anterior-facing 
mouth with radial circumoral papillae, and paired dorsolateral epider- 
mal specializations associated with paired lobopodous limbs that bore 
simple terminal claws (Supplementary Note 2, transformation series 31, 
18-20, 32, 1, 5, 52, 39). 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Weighting strategies. Under equal weights, each additional step in a tree topology 
is penalized equally. Taking an example from Goloboff"', if transformation series A 
has one step on tree X and two steps on tree Y, whereas transformation series B has 
15 steps on tree X and 14 steps on tree Y, trees X and Y each contain 16 steps in total 
and are thus treated as equally plausible. Implied weighting assumes that an addi- 
tional step in a highly homoplastic transformation series is preferable to an additional 
step in a less homoplastic transformation series: that is, it is more parsimonious to 
add a 15th step to transformation series B, which is already highly homoplasious, 
than it is to add a second step to transformation series A, which otherwise exhibits 
no homoplasy. 

Successive weighting** represents an iterative solution to this problem, calculat- 
ing the homoplasy of each transformation series under an equally weighted tree, then 
repeating the analysis using characters’ consistency index to repeat the analysis. This 
approach suffers from circularity; it generates different results from different start- 
ing points. The method of Goloboff* circumvents this problem by penalizing each 
additional step in a transformation series less strongly than the step before, thus 
weighting each transformation series according to its consistency with each eval- 
uated tree. The penalty attached to subsequent steps decreases at a rate set by a con- 
cavity constant, k; the lower the concavity constant, the less weight is attached to 
subsequent steps. 

With a small concavity constant, transformation series that have one or more 
additional steps are down-weighted to the point of irrelevance. At k < 1.5, a tree X 
where transformation series A has no additional steps and transformation series B 
has six additional steps will be preferred to a tree Y where both transformation 
series have a single additional step. At k < 1, transformation series B can have any 
number of steps on tree X, and tree X will still be preferred to tree Y. Thus k < 1 cor- 
responds toa stance that transformation series either represent homologies or con- 
tain no phylogenetic information at all: ifa transformation has occurred more than 
once, it is (almost) no more likely to appear twice than it is to appear 100 times. 
This approach is overly aggressive; indeed, k < 2 is rarely seen in the literature. 

With a larger concavity constant (k > 30, perhaps), characters with additional 
steps are scarcely down-weighted; implied weighting under large values of k approx- 
imates equal weighting. An intermediate value is therefore most suitable, but there 
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is no objective means to select this value. Values of k between 3 and 5 are typical, 
although the most appropriate value varies with the number of terminals (and thus 
opportunity for homoplasy) in a data set*’. One way to approach this issue is to re- 
peat an analysis over a range of values of k, and to identify the strict consensus of 
these possible trees**. Rather than select a narrow range of values, we took 99 values 
of k from a log-normal distribution, with mean = 5 and s.d. = 5, soas to exhaustively 
sample parameter space. The strict consensus of most parsimonious trees for each 
value of k, generated using the parsimony ratchet* and sectorial search** heuristics 
in TNT**”, is reported in the Supplementary Data. Figure 2 displays the consensus 
of all most parsimonious trees recovered at all sampled values of k (0.12-210) and 
the equal weights tree. 

Removal of transformation series. If transformation series 39 (‘sclerites consist 
ofa stack of constituent elements’) is excluded from the analysis, an identical topol- 
ogy is recovered for all values of k, demonstrating that homology between the stacked 
elements of onychophoran claws and the claws of Hallucigenia sparsa is supported 
even if the character is not reflected in the input matrix. 
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Extended Data Figure 1 | Claw measurements. To reconstruct the 
relationship between the stacked constituent elements, a digital image of a 
sclerite (a) was duplicated, rotated and enlarged such that its outer sclerite 
precisely overlay the inner sclerite in the original image (b; the cyan image has 
been enlarged by 5% and rotated to match the inner sclerite in the yellow 
image). Repetition of this process demonstrates a logarithmic growth trajectory 
(c);a logarithmic spiral was fitted to this trajectory and its Raupian parameters” 
calculated. This process was most accurate in the inner jaw elements, whose 
dentate margin provided multiple landmarks that allowed the precise fitting of 
subsequent images. Estimates were also obtained for the outer jaws and 


appendicular claws of Euperipatoides, and the claws and spines of H. sparsa. 
Hallucigenia spines demonstrated variability in Raup’s D because they are 
sometimes obliquely preserved, so the maximum value was taken as 
representative. The implied growth rate of 2.4 + 2.7% in Euperipatoides 
sclerites (range 0-8%; measured from five inner and six outer jaw sclerites) 
cannot persist throughout the organism’s lifespan, since moulting consistently 
occurs every 2 weeks (ref. 38). Either moulting occurs less frequently in wild 
populations, the rate of growth varies during ontogeny or the constituent 
elements deform slightly during growth. 
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Extended Data Figure 2 | Density of scaly ornament in a hallucigeniid spine 
with three constituent elements (Geological Survey of Canada 136958). 

a, Complete spine, showing regions of enlargement; b, apex of spine showing 
tips of two internal elements; c, margins of two internal elements faintly 
visible (dotted lines); density of scales where three elements are superimposed is 
0.050 scales per square micrometre; where two elements are superimposed it is 
0.039 scales per square micrometre; for a single element it is 0.026 scales per 
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square micrometre; slight deviation from a 3:2:1 ratio is attributed to decreased 
visibility of individual scales in occluded regions; d, up to five scales overlap; 
only two could overlap if scales were restricted to the front and back surfaces 
of a single element. Transmitted light images from multiple focal planes 
combined using CombineZM (A. Hadley). Scale bar, a, 100 um; b, 40 jim; 

c, 10 um; d, 5 um. 
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Extended Data Figure 3 | Claws of Euperipatoides kanangrensis innermost element (e) and outer elements (f); pigmented foot tissue only 
(Onychophora). a, b, Secondary electron images of a single claw, separated associated with inner two elements; g, digital superposition of e and f showing 
into outermost element (a) and inner elements (b), each with ornamented basal _ original claw construction. h, Abnormal claw with blunt tip reflected in each 
region. c, d, Differential image contrast images of a single claw, separated constituent element. Transmitted light images from multiple focal planes 
into outermost element (c) and inner elements (d). Nomarski interference combined using TuFuse (M. Lyons). Scale bar, 100 um. 

contrast accentuates the basal ornament. e-g, Single claw, separated into 
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Extended Data Figure 4 | Sclerite constitution in other taxa. a, b, Single 
constituent element in claws of (a) Nephrotoma spp. (Tipulidae, Hexapoda, 
Euarthropoda) and (b) Eutardigrada (species indeterminate). Nomarski 
interference contrast. c—e, Small carbonaceous fossils with stacked constituent 
elements, interpreted as appendicular sclerites of total-group onychophorans 
(images courtesy of T. Harvey): c, d, from the basal mid-Cambrian (Stage 5) 
Kaili biota’; e, articulated pair from the mid- to late Cambrian Deadwood 


Formation, each claw comprising two constituent elements. f, Three 
appendicular sclerites (claws: arrowed) from a single appendage of Aysheaia 
pedunculata from the mid-Cambrian Burgess Shale (ROM 63052), each 
comprising a single element. Transmitted light images from multiple focal 
planes combined using TuFuse (M. Lyons) and CombineZM (A. Hadley). 
Scale bars, 100 pm. 
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OSCA1 mediates osmotic-stress-evoked Ca**t 
increases vital for osmosensing in Arabidopsis 


Fang Yuan'*, Huimin Yang'*, Yan Xue'*, Dongdong Kong’, Rui Ye’, Chijun Li’, Jingyuan Zhang’, Lynn Theprungsirikul', 
Tayler Shrift!, Bryan Krichilsky', Douglas M. Johnson’, Gary B. Swift?, Yikun He!, James N. Siedow! & Zhen-Ming Pei! 


Water is crucial to plant growth and development. Environmental 
water deficiency triggers an osmotic stress signalling cascade, which 
induces short-term cellular responses to reduce water loss and long- 
term responses to remodel the transcriptional network and physio- 
logical and developmental processes’. Several signalling components 
that have been identified by extensive genetic screens for altered sen- 
sitivities to osmotic stress seem to function downstream of the percep- 
tion of osmotic stress. It is known that hyperosmolality and various 
other stimuli trigger increases in cytosolic free calcium concentra- 
tion ([Ca~*];)**. Considering that in bacteria and animals osmosens- 
channels serve as osmosensors”*, hyperosmolality-induced 
[Ca?*]; increases have been widely speculated to be involved in osmo- 
sensing in plants’’. However, the molecular nature of corresponding 
Ca’* channels remain unclear®!™"!, Here we describe a hyperosmolality- 
gated calcium-permeable channel and its function in osmosensing in 
plants. Using calcium-imaging-based unbiased forward genetic screens 
we isolated Arabidopsis mutants that exhibit low hyperosmolality- 
induced [Ca?* ]; increases. These mutants were rescreened for their 
cellular, physiological and developmental responses to osmotic stress, 
and those with clear combined phenotypes were selected for further 
physical mapping. One of the mutants, reduced hyperosmolality- 
induced [Ca** ]; increase 1 (osca1), displays impaired osmotic Ca** 
signalling in guard cells and root cells, and attenuated water tran- 
spiration regulation and root growth in response to osmotic stress. 
OSCA1 is identified as a previously unknown plasma membrane pro- 
tein and forms hyperosmolality-gated calcium-permeable channels, 
revealing that OSCA1 may be an osmosensor. OSCA1 represents a 
channel responsible for [Ca”*], increases induced by a stimulus in 
plants, opening up new avenues for studying Ca”* machineries for 
other stimuli and providing potential molecular genetic targets for 
engineering drought-resistant crops. 

The lack of information regarding the molecular nature of Ca** 
channels responsible for increases in [Ca**]; induced by various stimuli 
prompted us to design forward genetic screens to identify these sensory 
channels. With the assumption that osmosensing is the initial signalling 
event, the design was devised on the basis that the hyperosmolality-induced 
[Ca?*]; increase (OICI) is the earliest detectable event (~5 s) upon hyper- 
osmolality treatment (Extended Data Fig. la and Supplementary Infor- 
mation). Note that in contrast to traditional genetic screens, in which the 
phenotypes scored can take hours or days to reach a steady state, the entire 
transient OICI event lasts only ~5 min. The variation of the OICI could 
result in enormous numbers of false positives and make these screens dif- 
ficult to apply, possibly explaining why for over 20 years the phenomena 
of stimulus-triggered [Ca**]; increases have only recently been dissected 
genetically'*"? and the corresponding Ca** channels still remain unknown. 

To optimize screening conditions for mutants with reduced OICI, we 
grew ethyl methane sulphonate-mutagenized aequorin-expressing Ara- 
bidopsis M2 seeds, treated these seedlings with several concentrations of 
sorbitol, and analysed aequorin luminescence for each seedling (Extended 


Data Fig. 1b). At600 mM sorbitol, few seedlings had relative intensities 
lower than an arbitrary threshold, a noise level that was practical for use 
in genetic screens. We screened 85,600 M2 seeds, selected seedlings with 
low OICI signals (Extended Data Fig. 1c), tested these seedlings individ- 
ually for four generations, and isolated 23 putative mutants with reduced 
OICI. Then, several criteria were used to rank these mutants for further 
physical mapping: there was no mutation in aequorin; the morphology 
of mutant plants was similar to that of wild type throughout develop- 
mental stages; the root growth response to osmotic stress was compro- 
mised; the stomatal response to osmotic stress was affected; and finally 
the phenotype of reduced OICI could be verified at the cellular level by 
using another Ca’* indicator. After these rescreens and verifications, 
we named the most affected mutant as reduced hyperosmolality-induced 
[Ca?* ]; increase 1 (oscal), and describe it here. 

The basal [Ca’*]; was similar in wild-type and osca1 plants; while under 
sorbitol treatment [Ca?*]; was much lower in oscal (Fig. 1a, b; P< 0.001). 
The reduced OICI was not due to a lesser amount of total aequorin (Ex- 
tended Data Fig. 2a, b). We analysed the kinetics of the OICI using aequo- 
rin luminometry and observed that amplitudes were reduced in oscal1 
mutants (Fig. 1c). Then, we determined the dose dependence of the OICI 
(Fig. 1d). A Hill curve could be fitted to the data with an apparent disso- 
ciation constant (Kg) of 698 + 23 mM and 981 + 69 mM for wild type 
and osca1, respectively. Hill coefficients were 3.8 and 2.0 for wild type 
and osca1, respectively. Seedlings were treated with solutions containing 
mannitol, sucrose, ribose or N-methyl-D-glucamine, and reduced OICIs 
were also recorded in oscal (Extended Data Fig. 2c). In addition, osca1 
roots were slightly less sensitive to sorbitol treatment (Extended Data 
Fig. 2d). The apparent Kq of wild-type and osca1 plants were 382 + 18 mM 
and 411 + 21 mM, respectively. Furthermore, to determine if OSCA1 
is specific to hyperosmolality over other stimuli, we analysed [Ca’ * ]; ele- 
vation in response to HO , a well-documented inducer of increased 
[Ca**]; (refs 4, 6, 14), and observed no difference between wild type and 
oscal (Fig. le). These results demonstrated that [Ca**], increases induced 
specifically by hyperosmolality are impaired in oscal, and that OSCA1 
might be a major component of the OICI. 

To rule out the possibility that the low OICI at the whole-plant level 
was caused by the inefficient detection of [Ca?*]; by aequorin, we used 
another Ca?* indicator, yellow Cameleon 3.6, that delivers a higher tem- 
porospatial resolution’*’*. Note that we adopted several methods for 
analysing abscisic acid (ABA)-induced [Ca”*]; increases in guard cells 
and stomatal closure*!*!“!”, Addition of sorbitol induced [Ca ]; increases 
in both wild-type and osca1 guard cells; however, the amplitudes were 
significantly lower in osca1 (Fig. 2a—c). Similar OICI defects were seen 
in oscal root cells (Extended Data Fig. 2e-g). 

For a given sensor, after the conversion of the external signal into a 
secondary messenger, the signal should be funnelled on to downstream 
processes. We assessed whether OSCA1 is required for cellular processes 
that are known to be regulated by osmotic stress; that is, processes down- 
stream of the sensor. Stomatal pores formed by pairs of guard cells are 
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Figure 1 | Isolation of a genetic mutant with reduced hyperosmolality- 
induced [Ca?*]; increases (OICI) in Arabidopsis. a, OICIs in wild-type (WT) 
and oscal plants. Plants expressing aequorin were treated with 0 or 600 mM 
sorbitol, and [Ca**], was analysed by imaging aequorin. [Ca~*]; is scaled by 
a pseudo-colour bar. b, Quantification of OICI in leaves from experiments 
similar to those in a. Data for three representative experiments are shown 
(mean = standard error of the mean (s.e.m.); 1 = 30). c, Time-course 
analysis of OICI. Plants grown individually in a 96-well plate were treated with 
600 mM sorbitol, and luminescence was recorded at intervals of 1s. Data for 
16 seedlings are shown (mean ~ s.e.m.; two-way analysis of variance 
(ANOVA), P< 0.001). d, Increases in [Ca?*]; plotted as a function of applied 
sorbitol concentrations. Data for three separate experiments are shown 
(mean + standard deviation (s.d.); n = 30) and fitted to the Hill equation. 

e, Increases in [Ca**]; plotted as a function of applied H,O, concentrations 
(mean + s.d.; n = 30; two-way ANOVA, P > 0.3). 


the gateways for water loss and CO; uptake, and open and close in response 
to water availability*. The addition of 200 mM sorbitol caused stomatal 
closure in wild-type plants but this was much reduced in osca1 plants 
(Fig. 2d, e). Through analysing the steady-state responses of stomatal 
apertures to sorbitol, we confirmed that the oscal mutant was less sen- 
sitive than wild-type plants (Fig. 2f). The dose-dependence data were fitted 
to the Hill equation with a Kg of 102 + 2 mM and 323 + 39 mM for wild- 
type and osca1 plants, respectively. Note that the high apparent Ky values 
for OICIs observed in leaves might result from the slow penetration of 
solution through stomatal pores (Supplementary Information). Osmotic 
stress induces the accumulation of ABA, which triggers stomatal closure**. 
Nevertheless, ABA-induced stomatal closure was unaffected in osca1 (Ex- 
tended Data Fig. 3a), suggesting that OSCA1 may act upstream of ABA. 
These data indicate that OSCA1 may have a role in the calcium-mediated 
osmotic signalling in guard cells. 

To understand whether OSCA1 has a key role in response to osmo- 
tic stress at the whole-plant level, we directly monitored plant wilting 
under osmotic stress. We treated wild-type and osca1 plants with 20% 
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Figure 2 | Impaired guard cell osmotic stress signalling and attenuated 
plant responses to osmotic stress in oscal plants. a, Emission images 
(F535 nm and F485 nm) of epidermal strips from plants expressing YC3.6 were 
taken every 3 s, and ratiometric images (F535:F485) before and 20s after 
addition of 200 mM sorbitol are shown. b, The ratios were quantified from 
guard cell pairs in a (n = 5). c, Peak ratio changes from experiments similar to 
a and b are shown (boxes represent the standard error (s.e.), error bars are s.d.; 
n= 36; P<0.001). d, Light images of epidermal strips were taken at varied 
intervals, and guard cell images before and 30 min after addition of 200 mM 
sorbitol are shown. e, Changes in the width of stomatal pores in response to 
sorbitol from the same epidermal strips in d (mean + s.e.m.; n = 5 stomata; 
two-way ANOVA, P< 0.001; r.u., relative units). f, Stomatal apertures are 
plotted as a function of applied sorbitol concentrations (mean + s.e.m.; n = 80 
for 150 mM and 200 mM, and 60 for others; two-way ANOVA, P< 0.001). The 
apertures of wild type and osca1 before the sorbitol treatment were 3.19 jm 
and 3.05 j.m, respectively, and arbitrarily set to 1. g, Wild-type and osca1 plants 
grown side-by-side in the same pot were treated with 20% PEG. Plant 

photos taken at time 0 and 30 min are shown. h, Leaves are numbered, and 
representative leaf no. 1 from g are shown. i, Leaf area reduction was quantified 
from experiments similar to those in g. Data from six independent experiments 
are shown (boxes represent the s.e., error bars are s.d.; n = 30; P< 0.001). 

j, PEG-induced stomatal closure from experiments similar to those in g was 
quantified. Stomatal apertures (width) at time 0 were 2.61 [1m and 2.99 1m for 
wild type and osca1, respectively, and arbitrarily set to 1. Data from three 
independent experiments are shown (mean + s.d.; n = 60; two-way ANOVA, 
P<0.01). k, 1, Plants were grown in half strength Murashige and Skoog 
medium (’/2 MS) in the presence or absence of sorbitol for 10 days (k) and root 
length was quantified (1). Data are from ten independent experiments 

(mean + s.e.m.; n = 60; two-way ANOVA, P< 0.001). 


polyethylene glycol (PEG). The leaf areas were reduced much more in 
oscal than those in wild-type plants (Fig. 2g—i), consistent with the obser- 
vations that stomata were more open in oscal over the period of PEG 
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treatment (Fig. 2j) and that detached osca1 leaves lost water more rapidly 
than wild type (Extended Data Fig. 3b). 

We then examined long-term developmental responses to osmotic 
stress. Without osmotic stress wild-type and osca1 seedlings had similar 
root lengths, while sorbitol treatment inhibited root growth more in 
oscal (Fig. 2k, 1). Nevertheless, root growth in response to ABA was not 
affected in osca1 (Extended Data Fig. 3c). These analyses reveal that oscal 
displays defects in major aspects of the osmotic stress signalling pathway 
as well as whole-plant responses to osmotic stress, indicating a defect 
in the perception of hyperosmolality. 

Note that the osca1 plants could not be distinguished from wild type 
throughout developmental stages, and that the phenotypes other than the 
reduced OICI were either impractical to map or were not robust enough 
to allow mapping of the osca1 mutation, explaining why osca1 mutants 
have not been isolated by previous forward genetic screens. Genetic anal- 
ysis showed that the oscal phenotype was caused by a recessive muta- 
tion in a single nuclear gene (Extended Data Fig. 4a). We attempted to 
prepare a mapping population by crossing osca1 (Col-0) as well as other 
mutants with reduced OICI to the most commonly used ecotype, Land- 
sberg erecta. Unfortunately, it was not feasible to phenotype the F, and 
F; populations, possibly because variations introduced by the crosses 
between the two diverged ecotypes impaired the recognition of mutants 
having a relatively subtle phenotype. We tested several other commonly 
used ecotypes, and found that Wassilewskija (Ws) was the best (Extended 
Data Fig. 4a, b). We used ~ 12,600 F, seeds from the oscal X Ws cross, 
phenotyped their F seedlings, and obtained 628 mapping lines. Note that 
the disadvantage of using Ws was the lack of a whole-genome sequence 
at the time, and we had to develop DNA markers based on available single 
nucleotide polymorphisms (SNPs) (Extended Data Fig. 4c). 

Through fine mapping, OSCA1 was identified as a novel gene encod- 
ing a protein of 772 amino acid residues (At4g04340; Extended Data 
Fig. 4d). Two nucleotide mutations were found in osca1, which resulted 
in mutations of glycine 59 to arginine (G59R) and glycine 507 to aspartic 
acid (G507D). Hydrophobicity analyses predicted OSCA1 as an integral 
protein with nine transmembrane «-helices (Fig. 3a and Extended Data 
Fig. 5). The region between transmembrane helices 8 and 9 could be 
another transmembrane helix, or a re-entrant pore loop. To verify whether 
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Figure 3 | OSCA1 encodes a novel integral protein in the faerie 
membrane. a, The predicted membrane topology and protein structure of 
OSCA1. Transmembrane domains (TM), the pore domain and the two 
mutations in oscal are indicated. b, Complementation of the oscal phenotype 
by overexpression of OSCA1 (OSCA™ osca1-1). oscal-2, a T-DNA insertion 
line. c, Quantitative analysis of OICI in leaves from experiments as in 

b (mean = s.e.m.; n = 30 seedlings). d—f, Expression patterns of OSCA1 
promoter (pOSCA1)::GUS in leaves (d), guard cells (e) and roots (f). 

g, h, Plasma membrane localization of OSCA1 in seedlings expressing the 
OSCA1 promoter-driven OSCA1-GFP construct. GFP fluorescence was 
observed in the periphery of the turgid cells (g) and plasmolysed cells (h). 
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OSCA1 is responsible for these phenotypes, we found that a T-DNA- 
insertion-mutagenized line, oscal-2, had a reduced OICI phenotype 
similar to oscal (Fig. 3b, cand Extended Data Fig. 6a—c). Note that oscal 
is oscal-1 in this study. Additionally, overexpression of OSCA1 could 
complement the oscal phenotype. 

To understand the molecular mechanisms and physiological functions 
of OSCA1 in plants, we determined the expression patterns and subcellu- 
lar localization of OSCA1. Analysis of OSCA1 promoter::-glucuronidase 
(GUS) transgenic plants shows that OSCA1 was expressed in leaves, flow- 
ers and roots, and guard cells (Fig. 3d—f and Extended Data Fig. 6d-g). 
Similar patterns were seen in whole-plant GFP images of OSCA1 pro- 
moter::OSCA1-GFP transgenic plants (Extended Data Fig. 6h, i), con- 
sistent with the reduced OICI phenotypes seen in leaves and roots, as 
well as in guard cells. OSCA1-GFP was exclusively localized to the vicinity 
of the cell surface in turgid cells (Fig. 3g and Extended Data Fig. 6j) as 
well as plasmolysed cells (Fig. 3h); while GFP alone was localized through- 
out the cells (Extended Data Fig. 6k). The plasma membrane localiza- 
tion is consistent with the prediction by the subcellular location database 
for Arabidopsis proteins’*, and supported by studies on plasma mem- 
brane proteomes”””°. 

To determine if OSCA1 can directly mediate Ca’* influx, we expressed 
OSCA1 in human embryonic kidney 293 (HEK293) cells, and analysed 
its activity using Fura-2-based Ca”* imaging. We postulated that increas- 
ing the Ca’* concentration outside the cell might cause [Ca?*], elevation 
in cells overexpressing calcium-permeable channels even in the absence 
of the appropriate gating components. The elevation of external Ca”* 
from 0.1 mM to 2.5 mM induced much larger [Ca** ]; increases in cells 
expressing OSCA1 than those expressing an empty vector pcDNA3.2, 
or mutant OSCA1 (mOSCA1), which contains the two mutations iden- 
tified in oscal plants (OSCA1(G59R/G507D) (mOSCA1)); Extended 
Data Fig. 7a—e). We then determined if OSCA1 could mediate OICIs. 
Addition of sorbitol triggered larger [Ca**]; increases in cells expressing 
OSCA1 than cells harbouring an empty vector (Extended Data Fig. 7f, g) 
or mOSCA1 (Fig. 4a—-d). We determined the subcellular localization of 
OSCA1 and observed that only OSCA1-GFP was localized in the vicin- 
ity of the plasma membrane (Fig. 4e and Extended Data Fig. 8a, b). We 
employed the widely used Mn** quenching of Fura-2 fluorescence to 
monitor Ca”* entry into the cell. The addition of Mn*~ resulted ina pro- 
nounced quenching of Fura-2 fluorescence in OSCA1-expressing cells, 
but toa much lesser extent in cells expressing an empty vector or mOSCA1 
(Extended Data Fig. 8c). Together, these data demonstrate that expres- 
sion of OSCA1 promotes Ca’* influx across the plasma membrane in 
response to Ca** and hyperosmolality. 

To determine if OSCA1 functions as a calcium-permeable channel 
gated by hyperosmolality, we carried outa series of electrophysiological 
experiments at the whole-cell and single-channel levels in HEK293 cells. 
The OSCA1-transfected cells showed larger currents in response to sor- 
bitol treatment than GFP- or mOSCA1-transfected cells (Fig. 4f-i). The 
currents were enhanced at both positive and negative membrane poten- 
tials in a dose-dependent manner (Fig. 4g, j). A Hill curve could be fitted 
to the data with a Kg of 312 + 12 mM and a Hill coefficient of 4.3. The 
similar Hill coefficients of ~4 obtained in plants and in vitro may be asso- 
ciated with tetrameter structures commonly seen for ion channels'®"”’. 

Given that the hyperosmolality-gated OSCA1-mediated currents decayed 
very fast, and that OSCA1-mediated currents were larger than the con- 
trol under iso-osmotic conditions, as well as calcium-induced [Ca**]; 
increases in OSCA1-expressing cells, we characterized the electrophys- 
iological properties of OSCA1 under iso-osmotic conditions with the 
expectation that the residual currents could represent largely those acti- 
vated by hyperosmolality. The OSCA1-mediated currents appeared 
instantaneous, and showed a weak outward rectification (Fig. 4k and 
Extended Data Fig. 8d). We recorded OSCA1-mediated single-channel 
currents using outside-out membrane patches (Fig. 41), these currents 
were not found in control cells or mOSCA1-expressing cells (n > 25 
patches). The current-voltage relation gave rise to a conductance of 
49.2 + 5.5 pS (Extended Data Fig. 8e; n = 5). To determine the ionic 
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Figure 4| OSCA1 forms hyperosmolality-gated calcium-permeable 
channels in HEK293 cells. a, Increases in [Ca**]; in response to 650 mM 
sorbitol in HEK293 cells expressing OSCA1, or mutant OSCA1 
(OSCA1(G59R/G507D) (mOSCA1)). [Ca?*]; increases were analysed by 
Fura-2 emission ratios (F340 nm:F380 nm) and scaled using a pseudo-colour 
bar. b, c, Dynamic analyses of OICI in cells expressing OSCA1 (b) or mOSCA1 
(c) from experiments as in a. Data are mean = s.d. (n = 60); Sor indicates point 
of sorbitol administration. d, Quantitative analyses of the OICI peaks from 
experiments as in b, c and Extended Data Fig. 7g. Data for three separate 
experiments are shown (mean + s.e.m.). + Sor and — Sor, with and without 
the sorbitol treatment, respectively. pcDNA3.2, empty vector. e, Plasma 
membrane localization of OSCA1 in cells expressing OSCA1-GFP construct 


selectivity of OSCA1 channels, we substituted cationic compositions 
in the bath and recorded currents (Extended Data Fig. 8f). The OSCA1 
channel did not discriminate between monovalent and divalent cations, 
and hada slight preference for K" , showing the following permeability 
sequence: KT > Ba** ~ Ca** >Na* = Mg”* = Cs" (Extended Data 
Fig. 8g). Our data show that OSCA1 is a hyperosmolality-gated non- 
selective cation channel that permeates Ca*~ ions. 

It is well established that various abiotic and biotic stimuli trigger 
[Ca”*], increases by activating Ca** channels in plants®??-*4, OSCA1 
represents, to our knowledge, the first example of such a channel that 
has been identified genetically. OSCA1 was involved in osmotic-stress- 
induced fast signalling events, intermediate cellular processes and pro- 
longed growth and development responses, and was also activated by 
hyperosmolality, similar to the osmosensor TRPV4 (ref. 25), revealing 
OSCA1 to bean osmosensor. Life largely involves aqueous chemistry as 
most cells consist of over 80% water~’. A change in osmolality generates 
a stretch force on the plasma membrane, which activates osmosensors. 
Thus, osmosensors are known to bea subtype of mechanosensing chan- 
nels such as DEG/ENaC, TRP, K2P, MscS-like and Piezo in non-plant 
eukaryotes”*?!”°, No DEG/ENaC or TRP exist in plants'®'’. Although 
there are ten MscS-like and one Piezo homologues in Arabidopsis’, whether 
they function as osmosensors remains to be determined. The Arabidopsis 
MID1-complementing activity 1 and 2 (MCA1 and MCA2) proteins, 
which display homology toa yeast stretch-activated Ca** channel MID1, 
mediate hypo-osmolality-induced [Ca** ]; increases and mechanical res- 
ponses, but they are not pore-forming subunits’’. Arabidopsis thaliana 
histidine kinase HK1 may function similarly to the yeast osmosensor his- 
tidine kinase SLNI (ref. 28). Therefore, it would be interesting to study 
whether and how OSCA1 works together with those sensors to mon- 
itor water availability in plants. 

OSCA 1 belongs toa gene family with 15 members in Arabidopsis, and 
homologues are found in other plant species and throughout eukaryotes 
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with GFP construct as a control. f—h, Whole-cell currents recorded during 
rapid voltage ramps (+124 mV to -116 mV) in cells expressing empty vector 
(f), OSCA1 (g) or mOSCA1 (h). Currents were recorded in the standard 
bath solution (black), and then with 300 mM sorbitol (green). Currents were 
recorded every 10s with the largest currents shown. i, Averaged currents at 
-56 mV from experiments similar to these in f—h (mean + s.e.m.; n = 23 
(Control), 22 (OSCA1), 8 (mOSCA1)). j, Currents plotted as a function of 
applied sorbitol concentrations (mean + s.e.m.; n = 3 (75 mM, 150mM, 

225 mM), 5 (600 mM, 750 mM), 22 (300 mM, 375 mM, 450 mM)) and fitted 
to the Hill equation. k, Whole-cell currents in cells expressing OSCA1 or the 
GFP vector as a control. 1, Single-channel currents recorded in the outside-out 
patch from OSCA1-expressing cells. 


(Extended Data Figs 9 and 10). The yeast homologue RSN1 isa plasma 
membrane protein with unknown function (Supplementary Informa- 
tion)”. The founding member (OSCA3.1) of the family encoded early 
responsive to dehydration 4 protein (ERD4)*°. Nonetheless, we found 
that ERD4 knockout mutants displayed wild-type OICIs, suggesting 
that ERD4 may differ from OSCA1, reminiscent of the diverse functions 
of TRPs in animals”*. Identification of OSCA1 not only opens up a new 
avenue for studying osmosensing, but also sheds light on the molecular 
nature of Ca”* channels responsible for other stimuli, and may pro- 
vide potential molecular genetic targets for engineering crops resistant 
to drought. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Plant material and growth conditions. Arabidopsis thaliana ecotype Col-0 con- 
stitutively expressing intracellular Ca** indicator aequorin (pMAQ2; a gift from 
M. Knight)*' or constitutively expressing cameleon (YC3.6; a gift from S. Gilroy)’° 
were used. The T-DNA insertion line SAIL_607_F09 (oscal-2), WiscDsLox331H10 
(osca1-3), SALK_004685 (erd4) and SALK_078537.53.75.X (erd4)*°” were obtained 
from the Arabidopsis Biological Resource Center (ABRC). Plants were grown in soil 
(Scotts Metro-Mix 200), or in Petri dishes in half-strength Murashige and Skoog salts 
(1/2 MS; Sigma), 1.5% (w/v) sucrose (Sigma), and 0.8% (w/v) agar (Becton Dickinson) 
in controlled environmental rooms at 21 + 2 °C. The fluency rate of white light was 
~110 pmol m~*s~!. The photoperiods were 16h light/8 h dark cycles. Seeds were 
sown on soil/MS media, placed at 4 °C for 4 days in the dark, and then transferred 
to growth rooms. 

Aequorin bioluminescence-based Ca”* imaging. [Ca”*], was measured using 
Arabidopsis plants expressing aequorin as described previously***. Seedlings were 
applied evenly with 3.3 ml of 10 1M coelenterazine (Prolume) per 150 mm X 15mm 
Petri dish 12 h before imaging and placed in the dark. Aequorin bioluminescence 
imaging was performed using a ChemiPro HT system (Roper Scientific) equipped 
with a light-tight box, a cryogenically cooled and back-illuminated CCD camera and 
a liquid nitrogen autofiller. The camera was controlled by WinView/32 (Roper) and 
bioluminescence images were analysed using MetaMorph 6.3 (Molecular Devices). 
The recording of luminescence (L) was started 30 s prior treatments and lasted for 
5 min. Bright-field images were taken after aequorin imaging. The total aequorin 
luminescence (Lax) was estimated by discharging with 0.9 M CaCl, in 10% (v/v) 
ethanol*!*, The calibration of [Ca”*], measurements was adopted from the equa- 
tion described previously**: pCa = 0.332588 X (—log k) + 5.5593, where k is a rate 
constant equal to luminescence counts (L) divided by total remaining counts (Lmax) 
with modification. Considering the previous equation was designed for aequorin 
luminometry spectroscopy, we used the measurement from a microplate lumin- 
escence reader (see below for details) to calibrate the ChemiPro HT system. We 
treated plants with 0 to 1 M sorbitol and obtained L/L,,,, values of leaves or roots. 
Then, we fit these data to the previous equation pCa = a X (—log (L/Lyax)) + b, and 
obtained the equation pCa = 0.6747 X (—log k) + 5.3177. Note that the calculated 
[Ca’*]; presented in the current study are similar to those reported previously***, 
Data for dose-response curves were fitted to the Hill equation: [Ca**]; = [Ca7*];, max 
[sorbitol]”/(Kq + [sorbitol]”), where [Ca*"]j, max is the maximum possible [Ca**]; 
change; [sorbitol], applied sorbitol concentration; K4, the apparent dissociation 
constant; n, the Hill coefficient. All the treatments were carried out in the dark, and 
the experiments were carried out at room temperature (22—24 °C). 

Screen for mutants with low OICI. Arabidopsis seeds expressing aequorin were 
mutagenized with ethyl methane sulphonate (EMS) as described previously”. Briefly, 
seeds (15 ml) were imbibed overnight and then shaken in 10mM EMS for 15h. 
The M1 seeds were rinsed thoroughly with tap water, mixed in 0.1% agarose, and 
planted in 40 flats (25.4 cm X 50.8 cm) at approximately 800 M1 seeds per flat. The 
flats were placed at 4 °C for 4 days before transfer to a greenhouse, and mature M2 
seeds were collected in pools (~400 seedlings per pool). For screens for mutants 
with low hyperosmolality-induced [Ca?*]; increase (OICI), M2 seeds were steri- 
lized, and individual seeds were planted evenly using a template in 150 mm X 15 mm 
Petri dishes, and grown for 9 days. Aequorin bioluminescence images were acquired 
for the hyperosmolality treatment, that is, adding 600 mM sorbitol solution into 
Petri dish via a custom-built device. The total M2 seedlings that showed weaker 
[Ca’*], increases in leaves were picked up. These seedlings were then transferred to 
soil, and collected individually for seeds. From the second- to the fourth-round screens, 
individual lines were checked for the reduced OICI phenotype, and lines with the 
stable phenotype of low OICI were isolated as mutants with low OICI. To ensure 
that the low OICI phenotype was not caused by potential defects in aequorin-based 
calcium measurements, such as mutations in aequorin and the uptake of coelente- 
razine into to leaves, we sequenced the aequorin transgene in these putative mutants. 
In addition, we speculated that the reduced OICI phenotype should have cellular and 
physiological phenotypes in these putative mutants. Therefore, we analyzed known 
osmotic stress-regulated cellular and physiological processes and ranked these puta- 
tive mutants based on these phenotypes for further mapping with the expectation 
that we could have a high probability to identify key components in the osmotic 
stress signaling pathway in plants. 

Aequorin luminometry spectroscopy. Aequorin luminometry was carried out as 
described previously**'**. Aequorin-expressing Arabidopsis seeds of wild-type and 
oscal mutants were placed individually in each well in 96-well plates containing 
50 pl % MS medium, 1.5% (w/v) sucrose, and 0.8% (w/v) agar, and grown for 10 days. 
Kinetic luminescence measurements were performed with an automated micro- 
plate luminescence reader (Mithras LB 940, Berthold Technologies). Lumines- 
cence counts were integrated every 1s, and after automatic injection of 0.2 ml of 
600 mM sorbitol solution into each well that took about 3 s, bioluminescence was 


recorded for 60s per well. Luminescence values were calibrated as [Ca?*]; using 
the following equation (ref. 34): pCa = 0.332588 X (—log k) + 5.5593. 
Cameleon-based [Ca”*]; imaging in guard cells and root cells. The osca1-1 mutant 
was crossed into wild-type plants constitutively expressing GFP fluorescence reso- 
nance energy transfer (FRET)-based Ca”~ sensor yellow Cameleon 3.6 (YC3.6)!®"**, 
and five homozygous lines were generated. Cameleon-based [Ca**]; measurements 
in guard cells and root cells were conducted as described previously'*”’. Rosette leaf 
epidermal peels from 2-week-old plants were placed in a microwell chamber in the 
bath solution containing 100 4M CaCl, 5 mM KCI, 10 mM MES-Tris, pH 6.15 for 
2.5h under light (120 mol m~ s~'). Ratiometric Ca** imaging was performed 
using a fluorescence microscope (Axiovert 200; Zeiss) equipped with two filter wheels 
(Lambda 10-2; Sutter Instruments), and a cooled CCD camera (CoolSNAPfy; Roper 
Scientific). Excitation was provided at 440 nm, and emission ratiometric (F535 nm: 
F485 nm) images were collected using MetaFluor software. Hyperosmolality solu- 
tions were prepared by adding sorbitol to the bath solution, and epidermal peels were 
treated with these solutions at indicated time. Similarly, 5-day-old roots were used 
for YC3.6 imaging. 

Stomatal aperture and density bioassays. The time course of stomatal response 
to treatments was examined as previously described with slight modifications’®. 
Rosette leaf epidermal peels were placed in a microwell chamber and incubated in 
the opening solution containing 100 1M CaCl), 5 mM KCl, 10 mM MES-Tris, pH 6.15 
for 2.5h under light (120 pmol m~* s-*) as described above for imaging [Ca”*]; 
in guard cells. Light images of epidermal peels were taken using the Axiovert 200 
microscope at ~2 min intervals, and the width of stomatal apertures was analysed 
using ImageJ software (http://rsbweb.nih.gov/ij/index.html). Hyperosmolality solu- 
tions were prepared and added to the bath as described above. For the steady-state 
stomatal response to treatments of hyperosmolality and abscisic acid (ABA), exper- 
iments were carried out as described previously'*”*”’. Detached rosette leaves of 
Arabidopsis were floated in the opening solution for 2.5 h under light. The leaves 
were transferred to the opening solution containing additional sorbitol or ABA at 
indicated concentrations for 2 h under light. Light images of epidermal peels were 
taken using the Axiovert 200 microscope, and the width of stomatal pore was ana- 
lysed using ImageJ. Images of epidermal strips taken for stomatal aperture bioassay 
were reanalysed using Image] for stomatal densities and no difference of stomatal 
density between wild type and oscal1 was observed. 

Physiological analyses in osmotic stress responses. Polyethylene glycol (PEG)- 
based osmotic stress treatments were adopted from the experimental procedure 
described previously*°. Wild-type and osca1 plants were grown side-by-side in the 
same pots with a hole in the bottom for 23 days, and the pots were submerged into 
a solution containing 20% (w/v) PEG-6000 (average molecular mass 6000; Sigma). 
Note that the rosette leaves did not contact the PEG solution. The 20% PEG treat- 
ment causes a modest osmotic stress (—0.5 mPa)*°. Plants were photographed at 
1 min intervals, and photographs at time 0 and 30 min were shown and used for 
further image analysis. Leaf areas for individual leaves were quantified using ImageJ, 
and leaf-area reduction for each leaf was calculated. For stomatal aperture analysis 
(Fig. 3d), seedlings were removed from pots that were submerged into the PEG 
solution at the indicated time. Epidermal strips were prepared immediately, and 
the width of stomatal apertures was analysed as described above. For leaf water loss 
assays, fully expanded rosette leaves were detached from 3-week-old seedlings and 
placed in the same growth conditions as described previously***’. Each sample that 
had five individual leaves was weighed at the indicated time, and water loss was 
calculated in respect to the initial weight. 

Genetic analysis and physical mapping. We back-crossed mutants with low OICI 
to aequorin-expressing Col-0 three times. The homozygous mutant lines in the 
Col-0 background that showed a 1:3 mutant:wild-type ratio were crossed to the 
ecotype Wassilewskija (Ws) and followed by self-pollinating F, progeny to yield an 
F, population. For oscal mapping, seedlings from ~ 12,600 F> seeds grown on Petri 
dishes that showed kanamycin resistance (aequorin transgene) were transferred to 
soil. Note that the mapping lines should be homozygous at both the aequorin and 
oscal loci. We genotyped aequorin using PCR, and aequorin-homozygous lines 
were then harvested individually for F; seeds. These F; lines were analysed individ- 
ually for the reduced OICI phenotype using aequorin imaging. Eventually, homozy- 
gous osca1 lines with homozygous aequorin were obtained as the mapping population. 
Linkage analysis of F, plants revealed that the osca1 locus is located in chromosome 4. 
Since at the time that we were carrying out the physical mapping there was no whole- 
genome sequence of Ws, we downloaded the 250,000 single-nucleotide polymor- 
phism (SNP) data from the NSF 2010 Program (http://1001genomes.org/data/ 
MPI/MPIcollab201 1/releases/2011_06_28/strains/Ws-2/TAIR8/), and used the SNP 
information to design fine-mapping markers. Note that about 1 in 10 SNPs could 
be verified on average by sequencing and used for marker design. These markers were 
used to perform PCR and isolate the interval that flanks the mutation”. Finally, we 
sequenced open reading frames (ORFs) from the narrowest interval and identified 
mutations in oscal. 
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DNA constructs and transgenic lines. Gateway cloning*’ was used to construct 
p35S::OSCA1, p35S::OSCA1-GFP, pOSCA1::GUS, pOSCAI::0SCA1-GFP, pCMV:: 
OSCA1 and pCMV::OSCA1-GFP. The OSCAI full-length complementary DNA and 
the 2 kb promoter region were amplified by PCR from cDNA and genomic DNA, 
respectively. The cDNA fragment and the promoter region were cloned into the 
pENTR vector (Invitrogen). Coding sequences were transferred from the entry clones 
to gateway-compatible destination vectors (Invitrogen). Transgenic Arabidopsis lines 
were generated by agrobacteria-mediated transformation“, and homozygous trans- 
genic T3 lines carrying a single insertion were used. The osca1-2 (SAIL_607_F09) and 
osca1-3 (WiscDsLox331H10) lines were obtained from the ABRC. Homozygous lines 
were selected and the OSCA1 transcript was analysed by reverse transcription 
PCR (RT-PCR). The osca1-2 and osca1-3 mutants were crossed into the aequorin- 
expressing wild-type, and homozygous lines were generated. Note that, in the osca1-3 
background, aequorin expression was silenced and several osca1-3 aequorin lines 
identified could not be used to analyse the reduced OICI phenotype. 

OSCA1 mRNA analysis. The abundance of OSCA1 mRNAs from wild-type and 
oscal seedlings was analysed by RT-PCR as described’. Total mRNAs were pre- 
pared and reverse transcribed using a cDNA synthesis kit, and UBQ was used as a 
loading control. 

Histochemical GUS activity analysis. The histochemical staining for B-glucur- 
onidase (GUS) activity using the OSCA1-promoter-driven GUS (pOSCA1::GUS) 
transgenic lines was performed as described”’. Seedlings grown in 2 MS media or 
the soil were used for the histochemical staining**. Data represent six independent 
lines examined, which displayed similar staining patterns. Similar results were seen 
from OSCA1-promoter-driven (OSCA1-GFP) (pOSCA 1::0SCA 1-GEP) transgenic 
lines. 

OSCA1-GFP subcellular localization analysis. For analysis of OSCA1-GFP in 
Arabidopsis seedlings, both OSCA1 promoter-driven pOSCA1::0SCA1-GFP and 35S- 
promoter-driven p35S::OSCA1-GFP transgenic plants were generated as described**“*. 
The p35S::GFP transgenic plants were used as a control. Seedlings grown in 2 MS 
media in Petri dishes for 7 days were subjected to confocal imaging with the Zeiss 
LSM 710 microscope or whole seeding imaging with a Zeiss SteREO Discovery 
V20 microscope. Plasmolysis was developed by adding 0.8 M sorbitol. Data repre- 
sent more than 10 independent lines examined, which displayed similar GFP sub- 
cellular localization. For analysis of OSCA1-GFP) in HEK293 cells, cells were cultured 
on poly-lysine-coated glass coverslips and transfected transiently with pCMV::OSCA1- 
GFP as described above. About 18 to 24h after transfection, coverslips were mounted 
on glass slides and subjected immediately to GFP fluorescence imaging with the 
Zeiss Axiovert 200 microscope, as well as confocal imaging with the Zeiss LSM 710 
microscope. For confocal imaging a X63 water immersion objective was used. The 
plasma membrane localization is well supported by several studies on plasma mem- 
brane proteomes'?”°*”, 

Imaging of [Ca?*], in HEK293 cells. Human embryonic kidney 293T (HEK293T) 
were grown and maintained in DMEM medium supplemented with 10% fetal bovine 
serum, 1% penicillin and streptomycin in a CO, incubator at 37 °C. For transfec- 
tion, cells were seeded onto poly-lysine-coated eight-well chambered coverglasses 
(Nunc) overnight, and transfected with plasmid DNA using Lipofectamine 2000 
reagent (Invitrogen) as described previously’”“*“”. Cells were loaded with the Ca** 
sensitive dye Fura-2AM (5 1M; Sigma). A Fura-2-based Ca”* imaging assay was 
performed in the HEK293 cells 18 to 24h after transfection using the Axiovert 200 
fluorescence microscope. Emission ratiometric images (F340 nm:F380 nm) were col- 
lected using MetaFluor Fluorescence Ratio Imaging Software (Molecular Devices). 
Experiments were carried out at room temperature (22-24 °C). For further anal- 
ysis, about 25 to 30 cells per image were selected manually based on the increases in 
[Ca?*]; (from highest to lowest). For Ca’* treatment, Fura 2-loaded HEK293 cells 
were incubated in a standard buffer containing 130 mM NaCl, 3 mM KCl, 0.6 mM 
MgCl, 10 mM glucose, 10 mM HEPES, pH 7.4 (adjusted with NaOH), and 0.1 mM 
Ca’* for 30 min. The bath was perfused using a peristaltic pump (Dynamax RP-1, 
Rainin) witha 2.5 mM Ca’ solution prepared from adding Ca’ into the standard 
buffer, and Fura-2 ratiometric images were collected. For the hyperosmotic treat- 
ment, solutions with different osmolality were prepared by adding sorbitol to a Na‘ - 
free buffer containing 130 mM NMDG-Cl, 3 mM KCl, 2 mM CaCl,, 0.6 mM MgCh, 
10 mM glucose, 10 mM HEPES, pH 7.4 (adjusted with HCl). Unless otherwise des- 
cribed, the Ca?* concentration of all solutions was held constant at 2 mM. Osmo- 
lality was measured with a vapour pressure osmometer (Vapro 5520, Wescor). The 
bath was perfused with hypertonic solutions, and Fura-2 ratiometric images were 
collected and analysed. 

Mn?* quenching of Fura-2 fluorescence. HEK293 cells were transfected and loaded 
with Fura-2AM as described above. Emission (510 nm) images with three excita- 
tion wavelengths (340 nm, 358 nm and 380 nm) were recorded. Quenching of the 
358 nm signal, which is the calcium-independent wavelength of Fura-2 and reflects 
Mn’* influx across the plasma membrane*-*, was monitored subsequently in the 
presence of 1 mM Mn7". Background fluorescence was determined by supplementing 
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the standard buffer with 10 1M Triton X-100 and 10 mM Mn?*. The pcDNA3.2 
empty vector-transfected cells were used as a negative control. 

HEK293 cell electrophysiology. HEK293 cells were co-transfected with eGFP and 
OSCA1, mOSCAI or pcDNA3.2 at a ratio of 1:20, plated on poly-.-lysine coated 
glass coverslips, and then recorded for electrophysiological signals**”. Patch-clamp 
recordings were performed on eGFP-positive cells 24—36 h after transfection. Giga- 
ohm-seals were obtained with pipettes (Kimax 51) having a resistance of 3-5 MQ in 
a standard pipette solution (see below). Liquid junction potentials were measured 
and also calculated using pClamp 8.3 software (Molecular Devices), and correction 
for this offset was made as described in the software. Voltage-clamp experiments'***” 
were performed with Axopatch 200B patch-clamp amplifier (Molecular Devices), 
and data were acquired using Digidata 1322A interface and the pClamp software. 
The currents were recorded at a holding potential of —56 mV at room temperature 
and no leak subtraction was performed. Permeability ratios for monovalent cations 
to Cs* (PX/PCs) were calculated as follows: PX/PCs = exp(AV,eyF/RT), where Viey 
is the reversal potential, F is the Faraday’s constant, R is the universal gas constant, 
and T is the absolute temperature*****’. Divalent permeability was calculated as, 
PY/PCs = [Cs*], exp(AVreyF/RT)(1 + exp(AVyevF/RT))/4[Y~ ]o, where the brack- 
eted terms are ionic activities. Assumed ion activity coefficients were 0.75 for mono- 
valents and 0.25 for divalents”’. 

Solutions for electrophysiology. The standard pipette solution for all experiments 
contained 140 mM CsCl, 5 mM EGTA, 10 mM HEPES, pH 7.4 (adjusted with CsOH) 
as described**“**, The standard bath solution contained 140 mM NaCl, 5mM KCI, 
2mM MgCh, 2mM CaCh, 10 mM HEPES, 10 mM glucose, pH 7.4 (adjusted with 
NaOH). For monovalent-cation substitution experiments, the bath solution was 
changed to 140 mM NaCl (or KCI or CsCl), 10mM glucose and 10 mM HEPES 
(adjusted to pH 7.4 with NaOH, KOH or CsOH, respectively). For divalent-cation 
substitution experiments, the bath solution was changed to 112 mM CaCl, (or MgCl), 
10 mM glucose, 10 mM HEPES, pH 7.4 (adjusted with Ca(OH), or Mg(OH),, respect- 
ively). Reversal potential was determined using voltage ramps (+100 to —-100 mV 
in 1.56s) and current clamps at 0 pA. 

PCR primers and vectors. Genotyping primers: OSCA1-LP, 5’-TAACCATTCA 
GTTGGGTTTCG; OSCA1-RP, 5'-ATTGGACAAACAACGAGTTGG. T-DNA-LB, 
5'-TCTGAATTTCATAACCAATCTCG. RT-PCR primers: OSCA1_Fw, 5’-TG 
CTTGCTTGGGCAGTTCTTGTA; OSCA1_Rev, 5'-GGCAAGAAACTGAAGC 
CTCATGT. UBQ_Fw, 5'-TAAAAACTTTCTCTCAATTCTCTCT, UBQ_Rev, 
5'-TTGTCGATGGTGTCGGAGCTT. Cloning primers: OSCA1-cDNA_Fw, 5’- 
CACCATGGCAACACTTAAAGACATT; OSCA1-cDNA_Rev, 5’-(CTA)GACT 
TCTTTACCGTTAATAAC; OSCA1-eGFP_Fw, 5’-AAACTCGAGATGGCAAC 
ACTTAAAGACATTG; OSCA1-eGFP_Rev, 5’-AAACCGCGGGACTTCTTTAC 
CGTTAATAACGG. Primers for OSCA1 promoter: OSCA1P_Fw, 5’-CACCAG 
TCCCGCGATATTCAGC; OSCAIP_Rev, 5’-GCTTTGTTACTTTTGCTACTC 
CA. Vectors: pCMV::OSCA1:pcDNA3.2, pCMV::OSCA 1-GFP:pEGFP-N1, p35S:: 
OSCA1: pMDC32, pOSCA1::GUS:pMDC163, p35S::OSCA 1-GFP:pMDC83, and 
pOSCA1::O0SCA1-GFP:pMDC107. For mutant OSCA1 (mOSCA1) construct, the 
same two mutations (G59R and G507D) as in the osca1 mutant were introduced in 
wild-type OSCA1, and the vector for mOSCA1 was pCMV::OSCA1(G59R/G507D): 
pcDNA3.2. 

Gene accession numbers in GenBank. AtOSCA1.1, KJ920356; AtOSCA1.2, KJ920357; 
AtOSCA1.3, KJ920358; AtOSCA1.4, KJ920359; AtOSCA1.5, KJ920360; AtOSCA1.6, 
KJ920361; AtOSCA1.7, KJ920362; AtOSCA 1.8, KJ920363; AtOSCA2.1, KJ920364; 
AtOSCA2.2, KJ920365; AtOSCA2.3, KJ920366; AtOSCA2.4, KJ920367; AtOSCA2.5, 
KJ920368; AtOSCA3.1, KJ920369; AtOSCA4.1, KJ920370; OsOSCA1.1, KJ920371; 
OsOSCA1.2, KJ920372; OsOSCA1.3, KJ920373; OsOSCA 1.4, KJ920374; OsOSCA2.1, 
KJ920375; OsOSCA2.2, KJ920376; OSOSCA2.3, KJ920377; OSOSCA2.4, KJ920378; 
OsOSCA2.5, KJ920379; OsOSCA3.1, KJ920380; OsOSCA4.1, KJ920381. 
Statistical analysis. Independent experiments were performed at least three times. 
The statistical analysis was performed using EXCEL 10 software (Microsoft). Data 
were presented as mean + s.d. or s.e.m. To analyse the difference between genotypes 
two-way analysis of variance (ANOVA) was carried out using SAS 9.3 software (SAS 
Institute). For Fig. 2c, iand Extended Data Fig. 2g, the boxes represents.e., the error 
bars represent s.d., and means were within the boxes. P values < 0.05 were consid- 
ered statistically significant. 
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Extended Data Figure 1 | Events occurring after osmotic stress treatment, 
and optimized conditions for genetic screens for mutants with low 
hyperosmolality-induced [Ca**]; increases. a, Schematic illustration of 
events occurring after osmotic stress treatment. It is known that osmotic stress 
triggers a a signalling cascade, in which the earliest detectable event is an increase 
in [Ca?*], that lasts ~5 min (blue)****°, For immediate responses, the signal 
is funnelled to downstream events, such as the activation of ASK1 protein 
kinase, ABA accumulation and stomatal closure, leading to the reduction of 
water loss*. For sustainable responses, the expression profiles for many genes, 
such as DREB2A and RD29A, are altered*’. Collectively, although these events 
might start as early as the [Ca”*]; increase, they display a dynamic change 
(phase 1), and take a long time to reach a relative steady state (phase 2; 
Supplementary Information). Clearly, in contrast to traditional genetic screens, 
in which the phenotypes scored can take hours or days to reach a steady state”*, 
the entire transient OICI event lasts only ~5 min, which could be used to 
genetically dissect osmosensing. Recently, similar screens using pathogen 
elicitors and ATP have been carried out, while the associated Ca** channels 
have not been identified’*’*. The amplitudes of coloured polygons depict the 
dynamic activities of these events evoked by osmotic stress. b, Optimized 
conditions for genetic screens for mutants with low hyperosmolality-induced 
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M2 seeds were used to determine the optimum genetic screening conditions. 
Individual seeds were planted evenly using a template in 150 mm X 15mm 
Petri dishes, and grown for 9 days. The sorbitol solution at an indicated 
concentration was added into the Petri dish, and the aequorin images were 
acquired. Sorbitol concentrations from 0 to 800 mM were tested and 
representative aequorin images are shown (bottom). Relative [Ca?*]; in leaves 
is scaled by a pseudo-colour bar. Corresponding relative [Ca**]; for each 
individual seedling was analysed and plotted (top). At 600 mM sorbitol 
concentration, about 95% of seedlings showed an OICI using an artificial 
cut-off value (red line), which could be practically used to phenotype/score 
seedlings. Similar results were seen in more than 10 independent experiments 
and one representative experiment is shown. ¢, Isolation of individuals with low 
OICT in leaves in the first-round screen. The bright-field image was used to 
identify the position for each seedling (left). Individual seedlings with lower leaf 
OICTI signals in the bioluminescence image (right) were circled via image 
analyses, and selected seedlings were transferred from the Petri dish to soil. 
At the first round we picked up seedlings with low leaf OICI signals as putative 
oscal candidates. 
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Extended Data Figure 2 | Defect in hyperosmolality-induced [Ca”*]; 
increases in oscal. a, b, Similar total amount of aequorin in wild-type (WT) 
and osca1 seedlings. The same seedlings used in Fig. 1a were treated with a 
solution containing 0.9 M CaCl, and 10% (v/v) ethanol to measure the total 
amount of aequorin, and no difference between wild type and oscal was 
observed (a). Similar results were seen in >20 separate experiments. 
Quantification of total amount of aequorin in wild-type and osca1 plants from 
experiments as in a is plotted as mean + s.e.m. (b; n = 6; P > 0.8). c, The oscal 
mutant shows reduced OICIs. The aequorin-expressing wild-type and oscal 
seedlings grown side-by-side were treated with water or 440 mOsm solutions 
containing sorbitol, mannitol, sucrose, ribose or N-methyl-p-glucamine 
(NMDG), and changes in [Ca?*] ; in leaves were recorded. Data are 

mean ~ s.e.m. (n = 33 for sorbitol, 29 for mannitol and sucrose, 26 for ribose 
and 21 for NMDG). The responses to these compounds were significantly 
reduced in oscal compared to those in wild type (P < 0.005). d, Averaged 
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increases in [Ca?*], in wild-type and osca1 roots plotted as a function of applied 
sorbitol concentrations. Seedlings were grown in a Petri dish that was placed 
vertically similar to those in Fig. 1a, and aequorin images were acquired and 
analysed as in Fig. 1. Data for three separate experiments representing 30 
seedlings are shown (mean = s.d.; two-way ANOVA, P < 0.01). e—g, Reduced 
OICIs in root cells in oscal. FRET imaging of OICIs was carried out in root cells 
in wild-type and oscal plants expressing the Ca”* indicator protein YC3.6. 
Emission images (F535 and F485) of roots were taken every 3 s, and ratiometric 
images before and 20s after addition of 600 mM sorbitol are shown (e). The 
F535:F485 ratio is scaled by a pseudo-colour bar. The relative [Ca?*]; 
(F535:F485) in response to sorbitol treatment was quantified from these root 
cells in e (f; mean + s.e.m.; n = 10). Peak changes in ratios from experiments 
similar to e and f are shown (g; boxes represent s.e., error bars are s.d.; n = 26 
seedlings; P< 0.001). 
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Extended Data Figure 3 | OSCA1 acts upstream of ABA signalling in 
stomatal closure and root growth. a, Comparison of ABA-induced stomatal 
closing in wild type and osca1. Data shown are mean + s.e.m. (n = 60; two-way 
ANOVA, P> 0.5). Stomatal apertures were normalized with respect to the 
width in the absence of ABA. b, OSCA1 controls transpirational water loss in 
response to desiccation treatment. Rosette leaves from wild-type and oscal 
seedlings were detached, and transpirational water loss was analysed at the 


indicated time points after leaf detachment. Water loss was calculated as a 
percentage of the initial fresh weight. Data shown are mean + s.e.m. (n = 25 
leaves; two-way ANOVA, P< 0.001). ¢, Wild-type and osca1 plants were 
grown in 2 MS media containing 0-1 1M ABA, and root lengths were 
analysed similarly as in Fig. 2k, 1. Data from three independent experiments are 
shown (mean + s.d.; 2 = 30 seedlings; two-way ANOVA, P > 0.2). 
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a 
Number of F2 plants 
Crosses ————— Ratio (WT : osca7) 
WT osca1 
oscal x Colag 101.54+4.5 35.5+2.6 2.92 + 0.27 
osca1 x Ws 78.34 1.7 27.8222 2.93 + 0.24 
b Chromosome 4 
JV30/31 Os69 OSCA1 os79 MN4.2 NGA8 ciw6 
9.4 CM 4.2 CM 4.1CM 4.7CM 8.3CM 16.7 CM 
0S100 0S122 0S125 OSCA1 OS114 oss9 
2.08 CM 0.24CM 0.08 CM 0.16CM 1.04CM 
(26/628) (3/628) (1/628) (2/628) (13/628) 
CN 
Marker Physical Primers Type Restriction 
name position (bp) (forward, reverse) enzyme 
OS69 1112860 ATTCGGTATCAATGCCTTTC, GATGGAGAAGTTTTACTCTTTTGA SSLP 
OS100 1344258 CCACGGCACCTGCTAAA, TATACCTGATACTCTTGTTACCCTC CAPS Apol 
OS122 1977783 ATTTACAAACCCGAGCCT, TATCTATCACATCCATTACCGT CAPS BspCNI 
OS125 2098811 CTTTTAATTGTGTATATATAGGATCTTTC, TTATCTTAGACATGCACAATAAAACT ACAPS Dpnl 
OS114 2189910 CTACTATGGATGAGGACGACA, GTTGAAGGCGGGATTGA CAPS Xmnl 
OS89 2378036 TCTCTTCAAACCTCGATCTAAGT, TCTCTTTGTTCATCGTGTGAAT CAPS © Hinfl 
OS79 3006988 AACTAATCCAAGTCCAGCACGA, GATAGGGAATGGCATAAGATAAGC CAPS Mboll 
d 


1 MATLKDIGVSAGINILTAFIFFIIFAFLRLOPFNDRVYFSKWYLRGLRSS PASGGGFAGRFVNLELRSY LKFLHWMPEALKMPERELI DHAGLDSVVYLR 


101 IYWLGLKIFAPIAMLAWAVLVPVNWINNELELAKHFKNVTSSDIDKLTISNIPEGSNREWAHIIMAYAFTIWICYMLMKE YETVANMRLQFLASEGRRPD 


201 OFTVLVRNVPPDPDETVSELVEHFFLVNHPDNYLTHOVVCNANKLADLVSKKTKLONWLDY YOLKYTRNNSQIRPITKLGCLGLCGOKVDAIEHYIAEVD 


301 KTSKEIAEERENVVNDOKSVMPASFVSFKTRWAAAVCAOTTOTRNPTEWLTEWAAE PRDI YWPNLAI PYVSLTVRRLVMNVAFFFLTFFFIIPIAFVOSL 
TM4 


401 ATIEGIEKVAPFLKVIIEKDFIKSLIQGLLAGIALKLFLIFLPAILMIMSKFEGFTSVSFLERRSASRYYIFNLVNVFLGSVIAGAAFEQLNSFLNOSPN 


501 QIPKTIGMAI PMKATFFITYIMVDGWAGVAGEILMLKPLI I YHLKNAFLVKTEKDREEAMNPGS1IGFNTGEPQIQLYFLLGLVYAPVTPMLLPFILVFFA 


G507D 


601 LAYVVYRHQI INVYNQEYESAAAFWPDVHGRVITALIISOLLLMGLLGTKHAASAAPFLIALPVITIGFHRFCKGRFEPAFVRY PLQEAMMKDTLERARE. 


701 PNLNLKGYLQDAY IHKPVFKGGDNDDDGDMIGKLENEVIIVPTKRQSRRNTPAPSRISGESSPSLAVINGKEV 


Extended Data Figure 4 | Genetic analysis and map-based cloning of 
OSCAI. a, All F, seedlings derived from oscal X wild-type (WT; ColAQ, 
Col-0 expressing aequorin) crosses showed wild-type OICI signals. F, seedlings 
showed a 3:1 wild-type:oscal segregation, suggesting that the oscal phenotype 
resulted from a recessive mutation in a single nuclear gene. Note that it was 
not feasible to phenotype the F, and F; populations from crosses between 
oscal X Landsberg erecta (Ler). The F, seedlings, which were derived from 
oscal X Wassilewskija (Ws) crosses and also identified as aequorin 
homozygous, showed a 3:1 wild-type:oscal segregation. The same amount 

of F, seeds for each cross were placed in Petri dishes and OICI phenotypes 
were scored for individual seedlings (mean + s.e.m.; n = 4 and 6 for 

oscal X ColAQ and oscal X Ws crosses, respectively). b, Physical mapping 
of OSCA1. OSCA1 was positioned between JV30/31 and MN4.2 markers in the 


short arm of chromosome 4 close to centromere in a segregating F, population 
derived from the oscal X Ws cross. OSCA1 was fine-mapped to a region 
between OS114 and OS125 by analysing 1,256 recombinant chromosomes in 
the F, population with molecular markers listed in c. We sequenced all open 
reading frames (ORFs) in this region between these two makers and identified 
two mutations in an ORF, which corresponded to the gene At4g04340. 

c, Molecular markers developed for fine mapping. At the time when we were 
fine-mapping osca1, the whole-genome sequence for Ws was not available. 
Thus, we used the 250,000 single-nucleotide polymorphism (SNP) data to 
develop these markers. d, OSCA1 encodes a protein with transmembrane 
a-helices. The transmembrane o-helices (TM; blue), the putative ion channel 
pore-forming domain (green), and mutations of glycine 59 to arginine (G59R) 
and glycine 507 to aspartic acid (G507D) in red in oscal are shown. 
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Extended Data Figure 5 | Hydropathy and transmembrane probability transmembrane segment (b), or a re-entrant pore loop, a common structure 
plots of OSCA1. a, Hydropathy of OSCA1 was calculated using the Kyte- in ion channels. Based on the probability prediction, it is most likely to 
Doolittle algorithm, with a window size of 19 amino acids. The probabilities of be a re-entrant pore loop, which needs to be verified in the future. 
transmembrane helices (TM1-TM9) were predicted using TmHMM 2.0 b, Transmembrane «-helical spanners predicted by Aramemnon 


program”, and the probability plot (red) is superimposed to the hydropathy —_(http://aramemnon.botanik.uni-koeln.de)®. 
plot (blue). The region between TM8 and TM9 could be another 
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Extended Data Figure 6 | Verification of the T-DNA insertion osca1-2 
mutant and expression patterns and subcellular locations of OSCA1. 

a—c, Schematic illustration of the exon-intron structure of OSCA1 with the 
boxes representing exons (a). The mutations in oscal-1 and T-DNA insertion 
sites in osca1-2 and osca1-3 are illustrated. Primers for genotyping T-DNA 
insertion in oscal-2 are shown. BP, T-DNA boarder primer; LP, OSCA1 left 
primer; RP, OSCA1 right primer. The oscal1 refers to oscal-1 in this study. 
Genotyping of an osca1-2 homozygous line (b). PCR reactions with DNA show 
a flanking DNA fragment upstream (LP) and downstream of the insertion site 
(RP) in wild type (WT) but not in osca1-2, and a DNA fragment flanking 
the T-DNA boarder (BP) and the downstream of the insertion site (RP) in 
osca1-2 but not wild type, suggesting that osca1-2 is a homozygous T-DNA 
insertion line. The OSCA1 mRNA level was greatly reduced in osca1-2, but 
the expression of OSCA1 was not abolished (c), suggesting that osca1-2 is a 
knock-down mutant rather than a null mutant. d—g, Expression patterns of the 
pOSCA1::GUS in Arabidopsis leaf (d), flower bud (e), flower (f) and silique (g). 


pOSCA1::0SCA1-GFP 


p35S::0SCA1-GFP 


p35S::GFP 


p35S::GFP 


The intensity of blue represents the level of GUS activity. h—k, Expression 
patterns of OSCA1-GFP in Arabidopsis seedlings stably expressing OSCA1 
promoter-driven OSCA1-GFP construct (pOSCA1::OSCA1-GFP) (h) or 
CaMV 35S promoter-driven GFP construct (p35S::GFP) (i). GFP fluorescence 
was analysed using a Zeiss stereo microscope, and images were merged to 
generate the whole-seedling images. Insets are enlargements of root tips. Over 
10 homozygous single-insertion transgenic lines were generated for each 
construct, and similar results were observed from these lines. Plasma 
membrane localization of OSCA1 in Arabidopsis seedlings stably expressing 
CaMV 35S promoter-driven OSCA1-GFP construct (p35S::OSCA1-GFP) (j) or 
GFP alone as a control (p35S::GFP) (k). GFP fluorescence was analysed using 
confocal microscopy. Similar results were seen from over 10 independent 
homozygous single insertion transgenic lines. In addition, OSCA1 is also 
predicted to be localized to the plasma membrane by SUBA3 (http:// 
suba.plantenergy.uwa.edu.au/)'*. Moreover, OSCA1 has been identified 
independently by several studies of plasma membrane proteomes'??**7*". 
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e, Quantitative analysis of the peaks of CICI from 80 to 90s after addition of 


increases (CICI) in HEK293 cells. a, The increases in [Ca’*]; in response Ca’* from experiments as in b—d. We have also carried out experiments with 
to elevated Ca** in HEK293 cells expressing empty vector (pcDNA3.2; top), _ a range of concentrations of Ca?*, and calculated the Ky as 3.6 + 0.25mM. 


OSCA1 (middle), or mutant OSCA1 (OSCA1(G59R/G507D) (mOSCA1); Data for three separate experiments are shown (mean + s.e.m.). f, g, The 


bottom). HEK293 cells transiently transfected with empty vector pcDNA3.2, [Ca**]; increases in response to osmotic stress treatment in HEK293 cells 


OSCA1, or mOSCA1 were incubated in 0.1mM Ca?* bath solution, and then expressing pcDNA3.2, which were used as a control for HEK293 cells 

treated with 2.5 mM Ca**. The [Ca”"]; increase was analysed by Fura-2 expressing OSCA1 or mOSCA1 as shown in Fig. 4a. The cells were incubated in 
emission ratios (F340 nm:F380 nm) and scaled using a pseudo-colour bar. the standard bath solution, and then treated with 650 mM sorbitol. The [Ca**]; 
b—d, Dynamic analysis of CICI in HEK293 cells expressing empty vector (b), _ increases were analysed by Fura-2 emission ratios (f). OICIs in HEK293 
OSCA1 (c) or mOSCA1 (d) from experiments as in a. Data are mean + s.d. cells expressing empty vector from experiments as in f were quantified 

(n = 60 cells; r.u., relative unit). Arrows indicate the time of Ca?* addition. (g; mean + s.d.; n = 60 cells). Sor, sorbitol. 


Extended Data Figure 7 | OSCA1 confers calcium-induced [Ca**], 
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Extended Data Figure 8 | OSCA1 is localized to the plasma membrane and 
forms non-selective cation channels with permeability to Ca”* in HEK293 
cells. a, b, HEK293 cells were transiently transfected by OSCA1-GFP or GFP 
constructs, and GFP fluorescence was analysed using the Zeiss Axiovert 200 
fluorescence microscope. OSCA1 was localized in the vicinity of the plasma 
membrane (a); while GFP alone was localized throughout the cells (b). These 
cells were further analysed by confocal microscopy imaging (Fig. 4e). c, Ca”* 
influx across the plasma membrane was analysed using Mn** quenching 

of Fura-2 fluorescence in HEK293 cells. HEK293 cells transfected with 
pcDNA3.2, OSCA1 or mOSCAI were loaded with Fura-2 and incubated in the 
standard bath solution. The bath was perfused with the same solution added 
with 1 mM Mn*", and quenching of Fura-2 fluorescence at 358 nm was 


d 
fe 
& 8 
ra 
o 
5 
3 
24 
< 
o 
£ 
Oo 
o 
D 
& 
n0 ——1—— + — 7 
0 40 80 120 
V (mV) 
f 


g xX Frey (mV) Py/Pog 
K* 4216445 23 
Ba2t  -1.141.5 1.8 
pcDNA3.2 cat 3.2421 16 
mOSCA1 Nat 43.2416 114 
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monitored. Percentages show the relative quenching (F, Fura-2 fluorescence 
intensity F358; Fo, F358 at time zero; mean + s.e.m.; n = 60 cells). d, Averaged 
current--voltage relationships from experiments similar to those in Fig. 4k 
(mean + s.e.m.; 1 = 6 for control and 9 for OSCA1), Ctrl, control. e, Current- 
voltage relationship from single-channel recordings in experiments as in Fig. 41 
(mean + s.e.m.; 1 = 4). f, Superimposed whole-cell currents recorded during 
voltage ramps. OSCA1 currents were first recorded in the standard bath 
solution and then in solutions containing (in mM) 140 CsCl, 140 KCl, 

140 NaCl, 112 CaCl2, or 112 MgCl. g, Relative ion permeability (PX/PCs) 
of OSCA1 channels from experiments similar to those in f. Data are 

mean + s.e.m. (n = 3 to 9). 
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Extended Data Figure 9 | Phylogeny of OSCA1 family and alignment 
of eight members from the first clade of Arabidopsis OSCA1 family. 
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MATLKDIGVSAGINILTAFIFFIIFAFLRLQPFNDRVYFSKWYLRGLRSSPASG-GGF. FVNLELRSYLKFLHWMPEALKMPERELIDHAGLDSVVYLRIYWLGLKIFAPIAMLAWAV 
MATLODIGVSAGINILSAFVFFIIFAVLRLQPFNDRVYFSKWYLKGLRSS PARG-GAF. FVNLDFRS YMKFLNWMPEALKMPE PELIDHAGLDSVVYLRIYWLGLKIFTPIAVLAWAV 
MATLGDIGVAAAINILTAIIFLLAFAILRIQPFNDRVYFPKWYLKGIRSSPLHS-GAL' FVNVNLGSYLRFLNWMPAALKMPEPELI DHAGLDSAVYLRIYLIGLKIFVPIALLAWSI 
MATLADIGLAAAINILSALIFLLLFAILRIQPFNDRVYFPKWYLKGVRSSPVNS-GAF" IMNLDFRSYVRFLNWMPDALKMPEPELIDHAGLDSAVYLRIYLIGLKIFGPIALLSWSI 
MATLTDIGVAATINILTAFAFFIAFAILRLOPVNDRVY FPKWYLKGLRSSPIKT-GGF, FVNLDFRSY IRFLNWMPOQALRMPEPELIDHAGLDSVVYLRIYLLGLKIFFPIACIAFTV 
MATINDIGVAAAINIVTAFAFLLAFAI FRIQPVNDRVYFPKWYLKGLRSSSIQT-GGF FINLDFRSY IRFLNWMPEALKMPEPELVDHAGLDSVVYLRIYLLGLKIFFPIACVAFTT 
MASVQDIGLSAAINLLSAFAFLFAFAMLRLQPVNDRVY FPKWYLKGIRGSPTRS-RGI FVNLDWTTYVKFLNWMPAALQMPEPELIEHAGLDSAVY IRIYLLGLKMFVPITLLAFGV 


MATLODIGVSALINLFGAFLFLIAFAVLRIQPINDRVYFPKWYLTGERNSPRRSDRTL FVNLNYKTYFTFLNWMPQAMKMSESEI IRHAGLDSAIFLRIYTLGLKIFAPVMVLALVV 
Rep eR DR RR OR kD ORR Re RK kk Ok OK OK a ee eS ee eS co Se 2 ee Se eres 


LVPVNWINNELELAKHFKNVTSSDIDKLTISNIPEGSNRFWAHI IMAYAFTIWTCYMLMKEYETVANMRLOFLASEGRRPDOQFTVLVRNVPPDPDETVSELVEHFFLVNHPDNYLTHOQVV 
LVPVNWTNNTLEMAKOLRNVTSSDIDKLSVSNIPEYSMRFWTHIVMAYAFTIWTCYVLMKEYET IANMRLOFVASEARRPDOFTVLVRNVPPDADESVSELVEHFFLVNHPDHYLTHOQVV 
LVPVNWTSHGLOLAK-LRNVTSSDIDKLSISNIENGSDRFWTHLVMAYAFTFWTCYVLMKEYEKVAAMRLAFLONEQRRPDOFTVLVRNVPADPDESISDSVEHFFLVNHPDHYLTHOVV 
LVPVNWTSDGLOLAK-LRNVTSSNIDKLS ISNVERGSDRFWAHLVMAYAFTFWTCYVLMKEYEK IAAMRLSFLQSEKRRADQFTVLVRNVPPDSDES ISENVQHFFLVNHPDHYLTHQVV 
MVPVNWTNSTLDQLK---NLTFSDIDKLSISNIPTGSSRFWVHLCMAYVITFWTCFVLOQREYKHIASMRLOFLASEHRRPDQFTVLVRNI PPDPDESVSELVEHFFKVNHPDYYLTYQAV 
MVPVNWTNKGLDRLR-HSNISFSDIDKLSLSNIPNGSPRFWVHLCMAYAITFWICFILKREYONIALMRLOFLANDOQRRPNOFTVLVRNI PADPHES ICELVEHFFKVNHPDHYLTFQAV 
LVPVNWTGETLENI D---DLTFSNVDKLSISNVPPGSPRFWAHITMTYVITFWICY ILYMEYKAVANMRLRHLAAESRRPDQLTVLVRNV PPDPDESVNEHVEHFFCVNHPDHYLCHQVV 
LVPVNVSSGTLFFLK--KELVVSNIDKLS ISNVQPKSSKFFFHIAVEYIFTFWACFMLYREYNNVAIMRLOY LASQRRRPEQFTVVVRNV PDMPGHSVPDTVDQFFKTNHPEHYLCHQAV 
pkRKR * tro Rp pee, eR: a ee ee ee tt nS re et ee eo oes SDL op Rr pee eee, RR kk 


CNANKLADLVSKKTKLONWLDY YOLKYTRNNSQIRPITKLGCLGLCGQKVDAIEHY IAEVDKTSKEIAEERENVVNDQKSVMPASFVSFKTRWAAAVCAQTTQTRNPTEWLTEWAAEPRD 
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a, The sequences of OSCA1 homologues from Arabidopsis thaliana were alter the conformation of OSCA1, leading to changes in OSCA1 activity. 


retrieved from NCBI GenBank. The phylogeny of OSCA1 homologues was 
analysed using DNASTAR Lasergene 11 with Clustal Omega(ref. 62), and the 
phylogenetic tree was illustrated using FigTree 1.4 (http://tree.bio.ed.ac.uk/ 
software/figtree/). b, Eight AtOSCA members from the first clade were aligned 


using Clustal Omega (ref. 62). The first mutation of G59R and the second colour code is identical to the code defined in ref. 62. 
mutation of G507D in the oscal mutant are shown. The mutation of polar 
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glycine 59 to basic arginine (G59R) and especially the mutation of highly 
conserved glycine 507 to acidic aspartic acid (G507D) might both markedly 
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Red, small (small and hydrophobic) amino acids; blue, acidic amino acids; 
magenta, basic amino acids excluding H, R and K; green, hydroxyl, sulphydryl 
and amine amino acids. (amine amino acids are those with a functional group 
that contains a basic nitrogen atom, such as asparagine and glutamine). The 
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Extended Data Figure 10 | Phylogeny of OSCA1 family across the taxa. The 
sequences of OSCA1 homologues from several species across the taxa were 
retrieved from NCBI GenBank. The phylogeny of these homologues were 
analysed using DNASTAR Lasergene 11 using Clustal Omega (ref. 62), and the 
phylogenetic tree was illustrated using FigTree 1.4. Four clades were classified 
based on the phylogenetic tree with the clade 3 and 4 uniquely having 1 or 2 
genes for vascular plants. At, Arabidopsis thaliana; Bradi, Brachypodium 
distachyon; Cr, Chlamydomonas reinhardtii; Hs, Homo sapiens; Mm, Mus 
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musculus; Os, Oryza sativa; Pp, Physcomitrella patens; Pt, Populus trichocarpa; 
Sb, Sorghum bicolor; Sc, Saccharomyces cerevisiae; and Sm, Selaginella 
moellendorffii. OSOSCA1.1, Os01g0534900; OsOSCA1.2, Os05g0594700; 
OsOSCA1.3, Os05g0393800; OsOSCA1.4, Os10g0579100; OsOSCA2. 1, 
Os12g0633600; OsOSCA2.2, Os03g0673800; OsOSCA2.3, Os03g0726300; 
OsOSCA2.4, Os12g0582800; OsOSCA2.5, Os01g0950900; OsOSCA3.1, 
O0s07g0150100; OsOSCA4. 1, Os03g0137400. 
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Antiviral immunity via RIG-I-mediated recognition 
of RNA bearing 5’-diphosphates 
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Christine Schuberth’, Annemarthe G. Van der Veen!, Tsutomu Fujimura”, Jan Rehwinkel'+, Jason A. Iskarpatyoti**, 
Winfried Barchet”, Janos Ludwig”, Terence S. Dermody**°, Gunther Hartmann? & Caetano Reis e Sousa! 


Mammalian cells possess mechanisms to detect and defend themselves 
from invading viruses. In the cytosol, the RIG-I-like receptors (RLRs), 
RIG-I (retinoic acid-inducible gene I; encoded by DDX58) and MDA5 
(melanoma differentiation-associated gene 5; encoded by IFIH1) sense 
atypical RNAs associated with virus infection’”. Detection triggers a 
signalling cascade via the adaptor MAVS that culminates in the pro- 
duction of type I interferons (IFN-a and B; hereafter IFN), which are 
key antiviral cytokines. RIG-I and MDAS are activated by distinct viral 
RNA structures and much evidence indicates that RIG-I responds 
to RNAs bearing a triphosphate (ppp) moiety in conjunction with a 
blunt-ended, base-paired region at the 5'-end (reviewed in refs 1-3). 
Here we show that RIG-I also mediates antiviral responses to RNAs 
bearing 5’-diphosphates (5’pp). Genomes from mammalian reovi- 
ruses with 5’ pp termini, 5’ pp-RNA isolated from yeast L-A virus, and 
base-paired 5’pp-RNAs made by in vitro transcription or chemical 
synthesis, all bind to RIG-I and serve as RIG-I agonists. Furthermore, 
a RIG-I-dependent response to 5'pp-RNA is essential for controlling 
reovirus infection in cultured cells and in mice. Thus, the minimal 
determinant for RIG-I recognition is a base-paired RNA with 5’ pp. 
Such RNAs are found in some viruses but not in uninfected cells, 
indicating that recognition of 5’pp-RNA, like that of 5’ppp-RNA, 
acts as a powerful means of self/non-self discrimination by the innate 
immune system. 

RIG-I contributes to IFN production by cells infected with reovirus or 
transfected with the double-stranded (ds)RNA segments of the reovirus 
genome*’. Short stretches of dsRNA have been thought to be responsible° 
but this is hard to reconcile with the fact that RIG-I activation depends 
on its carboxy-terminal domain (CTD), which caps RNA ends rather 
than folding over stems*"’. The CTD contains a pocket that accommo- 
dates 5’ppp-RNA allowing for extensive interactions with the «- and 
B-phosphates but, interestingly, less so with the y-phosphate**"*. Fur- 
thermore, an earlier RIG-I CTD structure showed a complex with a 5’ 
di- rather than tri-phosphate RNA, possibly as a result of 5’ppp-RNA 
hydrolysis during crystallization'’. Notably, all 10 reovirus genome seg- 
ments display a free 5'pp on the negative strand as a result of tripho- 
sphate processing by a viral phosphohydrolase’’ (also see Supplementary 
Fig. 1 and Extended Text for Supplementary Fig. 1). We therefore hypoth- 
esized that a 5'pp blunt-ended, base-paired RNA such as found in reo- 
virus genomic RNA can bind RIG-Iand serve as a physiological agonist 
for antiviral immunity. 

First, we assessed the 5’-phosphate-dependence of stimulation by reo- 
virus RNA. RNA extracted from cells infected with reovirus strain type 3 
Dearing (reoT3D) or isolated from reoT3D virus particles (viral RNA, 
vRNA) induced the expression of an IFN-B reporter gene following 


transfection into HEK293 cells (Fig. 1a, b). Calf intestinal phosphatase 
(CIP; Fig. 1a, b) treatment substantially reduced the stimulatory activity of 
reovirus vRNA, like it did of RNA from influenza A virus (IAV)-infected 
cells, a known 5’ ppp-dependent RIG-I stimulus’*™. Similar results were 
obtained using reovirus strain type 1 Lang (reoT1L) and a distinct 5’- 
polyphosphatase (Extended data Fig. la and Supplementary Fig. li). The 
response to total reovirus VRNA could be recapitulated using purified 
large (L), medium (M), and small (S) genome segments (Fig. 1c, d and 
Extended data Fig. 1b) but not short (<20 residues, labelled <S; Fig. 1c) 
single-stranded (ss)RNA oligonucleotides encapsidated within purified 
virions’. 

The role of RIG-I versus MDAS in responses to reovirus is unclear*”’. 
HEK2393 cells used for reporter assays respond strongly to RIG-I agonists 
but poorly to triggers of MDAS (data not shown). To dissect pathways 
involved in 5’pp-RNA recognition, we therefore switched to mouse cells 
(dendritic cells (DCs) or mouse embryonic fibroblasts (MEFs)) that dis- 
play sensitivity to agonists of either RLR (Fig. le-h). RIG-I- or MDA5- 
deficient cells showed the expected loss of IFN-response to selective RIG-I 
or MDAS agonists (99-nucleotide ppp-IVT-RNA*”™ or Vero-EMCV- 
RNA, respectively) but retained the capacity to respond to reovirus VRNA 
(Fig. 1f, g, Extended Data Fig. 1d and data not shown). Abrogation of 
the response to reovirus VRNA was only observed in MAVS-deficient 
or RIG-I/MDA5 doubly-deficient MEFs (Fig. le, h)°. However, com- 
pensation in RIG-I-sufficient but MDA5-deficient (MDA5 ~/~) cells was 
lost upon VRNA treatment with phosphatase, even though the same treat- 
ment did not affect RIG-I-’~ cells (Fig. 1f, g). Thus, when RIG-I is the 
dominant RLR (MDA5 /~ mouse cells or HEK293 human cells), re- 
sponses to reovirus RNA are sensitive to phosphatase treatment. These 
data indicate that reovirus genome segments can activate both MDA5 
and RIG-I irrespective of their length, but that 5’-diphosphates on the 
reovirus genome are required for RIG-I but not MDA5 activation. A role 
for 5’-diphosphate-moieties in RIG-I activation by viruses was further 
confirmed using the L-A totivirus, a dsRNA virus commonly found in 
Saccharomyces cerevisiae, that synthesizes transcripts with a 5’pp ter- 
minus and is thought to harbour a genome with capped or diphosphate 
5'-ends'*"* (Fig. 1i-l and Extended Data Fig. 1c, d). Thus, 5’-diphosphate- 
bearing RNAs of two distinct viral origins act as agonists for RIG-I. 

To determine whether RIG-I associates with 5’pp-containing viral 
RNA in infected cells, nucleic acids were purified from Flag-tagged RIG-I 
that was precipitated from cells infected with either reoT1L or reoT3D. 
In both cases, we recovered stimulatory RNA from anti-Flag but not from 
control immunoprecipitations (Fig. 2a and Extended Data Fig. 2). Similar 
results were obtained when recombinant Flag-tagged-RIG-I was incu- 
bated with total reovirus VRNA, purified S and L segments or with L-A 
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virus genomic and transcript RNA (Fig. 2b, c). In all cases, the stimu- 
latory activity of RNA associated with RIG-I precipitates was lost after 
treatment with phosphatase (Fig. 2b, c). Thus, RIG-I can directly bind 
viral RNAs bearing 5’-diphosphates. 

Previous data suggested that in vitro transcribed (IVT)-RNA with a 
5'pp does not serve as a RIG-I agonist’”. The IVT-RNA in question bore 
a single guanosine residue at the 5’ end to permit generation of 5’pp 
when GDP instead of GTP was included in the transcription reaction”. 
However, because transcriptional elongation requires a triphosphate- 
bearing nucleoside to form the phosphodiester bond, GDP also prevented 
the generation of the polymerase copy-back IVT base-paired by-products 
later demonstrated to be required for RIG-I stimulation'*”’. We there- 
fore re-synthesized the short 25-nucleotide 5’-diphosphate transcript 
(5'’pp-IVT-RNA”*"), but this time annealed it to complementary (anti- 
sense, AS) synthetic RNA to form the requisite base-paired structure 
(Fig. 3a). As expected'*"”, a control 5’p-IVT-RNA*”™ synthesized using 
GMP was not stimulatory independently of hybridization to AS RNA 
(Fig. 3b). In contrast, the positive control 5‘ ppp-IVT-RNA”*™ made with 
GTP was stimulatory even without AS annealing (Fig. 3b), as a conse- 
quence of the aforementioned by-products'*"’. Most notably, 5’pp-IVT- 
RNA” was also stimulatory but only when annealed to the AS strand 
(Fig. 3b). Treatment with phosphatase resulted in a complete loss of stim- 
ulatory activity (Fig. 3c), demonstrating strict dependence on the 5’- 
diphosphate moieties, and the response was MAVS- and RIG-I-dependent 
(Fig. 3d, e), as predicted. Stimulatory activity was preserved in gel-purified 
5'pp-IVT-RNA”"'+ AS (data not shown) and the purity of all guano- 
sine batches used to generate the 5’ mono-, di- or triphosphate IVT RNAs 
was verified by liquid-chromatography mass spectrometry (Extended 
Data Fig. 3a). Further excluding a role for contamination, no stimulatory 
RNA was generated even when a 5'p-IVT-RNA reaction was deliber- 
ately spiked with up to 10% GTP (Extended Data Fig. 3b). 


To strengthen these observations, we chemically synthesized a 24- 
nucleotide 5’ppp-RNA (5'ppp-RNA”*")° and subjected half of the 
sample to enzymatic hydrolysis of the y-phosphate using the 5’-RNA 
triphosphatase activity of the vaccinia virus capping enzyme to generate 
5'’pp-RNA™". The purity of both 5’ppp-RNA™™ and 5’pp-RNA™*"* 
was validated (Extended Data Fig. 3c) and the RNAs were annealed to 
AS RNA (+AS) and assessed for IFN-inducing ability. 5’ pp-RNA™“""+ AS 
was clearly stimulatory for both human and mouse cells in a RIG-I- 
dependent manner (Fig. 3f, g) and only threefold less active than the 
5'ppp-RNA™""+ AS control (Fig. 3f). Binding assays showed that RIG-I 
has similar affinity for both RNAs (apparent Kp of 16.7 nM for 5’pp- 
RNA~“""+ AS versus 9.4nM for 5'ppp-RNA*~*"+ AS) but binds 5’p- 
RNA~"*+ AS much more weakly (Fig. 3h). The latter RNA also failed 
to induce IFN (Fig. 3f), consistent with reports that ligand binding is nec- 
essary but not sufficient for RIG-I activation'®". Altogether, we conclude 
that, similar to natural 5’ pp-containing viral RNA, synthetic 5’pp-base- 
paired RNAs trigger a RIG-I-dependent response. 

Lastly, to assess the physiological importance of our findings, we studied 
the innate immune response to reovirus infection in MDA5- or MAVS- 
deficient cells and mice. At 48 h post-infection, there was an increase in 
reoT3D S4 genome segment copy number in DCs incapable of respond- 
ing to MDAS or RIG-I agonists (MAVS ’~) compared with MDA5- 
deficient cells in which the RIG-I pathway is intact (Fig. 4a, b). As reported 
for MEFs‘, the control of viral replication correlated with the respective 
capacity of MAVS ’~ and MDA5 ‘~ DCsto produce IFN following infec- 
tion (Fig. 4a, b). Importantly, there was little difference between WT and 
MDAS5 “ DCs, indicating that RIG-I alone can serve to control reovirus 
infection in these cells. This conclusion was confirmed using MEFs defi- 
cient for either or both RLRs: an increase in viral load was only observed 
in RIG-I/MDAS5 doubly deficient cells (Fig. 4c). Finally, MDA5 ’~ and 
MAVS ‘~ mice were perorally infected with reovirus. Reovirus replication 
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in the intestine and the mesenteric lymph nodes (MLN) was significantly 
greater in RIG-I-signalling-incompetent mice (MAVS ‘~) compared 
with RIG-I-signalling-competent mice (MDA5 “_; Fig. 4d, e). Taken 
together, these data demonstrate that RIG-I can control the replication 
of reovirus, which bears a 5’-diphosphate dsRNA genome. 

The detection of virus infection is crucial for the successful initiation 
of innate and adaptive antiviral responses. Virus recognition can be medi- 
ated by RIG-I through the sensing of viral genomes bearing blunt-ended, 
base-paired termini with 5'-triphosphates’ °. Here, we show that a diphos- 
phate at the 5'-end of blunt-ended, base-paired RNA is in fact sufficient 
to activate RIG-I. These findings provide physiological meaning to recent 
structural data demonstrating that the RIG-I CTD can accommodate 
both 5’pp and 5’ppp-RNA**"". They also reiterate the importance of 
end- rather than stem-based recognition of RNA by RIG-I (refs 9, 10, 22), 
and thereby raise the possibility that some viral and synthetic 5’ppp-free 
RIG-I agonists bear a 5'pp-moiety that contributes to activity. In this 
regard, the commonly used IFN-inducing stimulus polyinosinic:poly- 
cytidylic acid (poly(I:C)) is a homopolymer synthesized from inosine 


Figure 2 | RIG-I associates with 5’-diphosphate- 
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and cytidine diphosphate likely to contain a certain proportion of 5’- 
ends bearing diphosphates (Extended Data Fig. 4a)”*. We noticed that 
the stimulatory capacity of some batches of poly(I:C) can be decreased 
by phosphatase treatment, especially when this is coupled with mild 
RNase treatment to free up strands (Extended Data Fig. 4b, c). A role for 
5'-diphosphates in poly(I:C) and reovirus recognition by RIG-I offers 
an alternative explanation for the observation that short poly(I:C) or 
short reovirus RNA segments preferentially activate RIG-I versus MDAS’. 
Rather than RIG-I and MDAS detecting different dsRNA lengths, perhaps 
it is the ratio of 5’pp-ends to stems that dictates which RLR dominates 
in recognition. Notably, many viruses display 5'ppp-RNAs in infected 
cells as a result of using a primer-independent mechanism for RNA 
synthesis. However, viruses such as reoviruses and poxviruses™* encode 
5'-RNA triphosphatases that catalyse hydrolysis of the y-phosphate in 
viral genomes or primary transcripts to generate 5'pp-RNAs. In contrast, 
host cytosolic RNAs (messenger RNA, transfer RNA or ribosomal RNA) 
are uniformly devoid of both 5’ppp- and 5’pp-structures**. Thus, the 
capacity of RIG-I to respond to base-paired RNAs with either a 5’ di- or 
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Figure 4 | RIG-I is required for control of 
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extends the number of viruses that can be detected by a single innate 
immune sensor. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Reagents. Poly(dA:dT) (P0833) and ribavirin were purchased from Sigma-Aldrich 
and used at 500 ng ml” ' and 400 |.M final concentration, respectively. Cyclic digua- 
nosine monophosphate (cyclic-di-GMP) was purchased from BioLog Life Science 
Institute (Bremen, Germany) and used at a concentration of 1 pg ml *. Recombi- 
nant IFN-A/D was purchased from PBL Assay Science (Piscataway, NJ). 

For production of recombinant RIG-I used in Fig. 2, the 3x Flag-human RIG-I 

DNA sequence’’ was amplified using forward primer 5’ -actcgagttatggactacaaagac 
catgacgg-3’ and reverse primer 5’ -ttgcggccgctcatttggacatttctgctggatcaa-3’ and cloned 
into the pBacPAK-His3-GST plasmid. Recombinant 3 x Flag-human RIG-I was 
expressed as a GST-tagged protein in SF9 insect cells using a baculovirus expression 
system and purified in a single step by affinity chromatography using glutathione- 
sepharose matrix (GE Healthcare, Little Chalfont, United Kingdom). The protein 
was eluted by GST tag cleavage using 3C enzymatic digestion. A final polishing step 
was accomplished using a Superdex 200 10/300 GL column (GE Healthcare). Protein 
purity was verified by acrylamide gel electrophoresis, and protein yield was quan- 
tified using a Nanodrop apparatus (ThermoScientific, Waltham, MA). 
Cells. All cells were mycoplasma negative and cultured using tissue-culture treated 
polystyrene plates (Falcon, Fisher Scientific International Inc., Hampton, NH) in 
an incubator with 5- 10% CO, and at 37 °C. HEK293 cells were provided by F. Weber 
(Freiburg, Germany). Vero and L929 cells were obtained from Cancer Research UK 
Cell Services. Dulbecco’s Modified Eagle Medium (DMEM) (Gibco, Life Technol- 
ogies, Carlsbad, CA) supplemented to contain 10% FCS (Autogen Bioclear UK, 
Ltd, Mile Elm, United Kingdom), 100 U ml? penicillin, 100 U ml! streptomy- 
cin, and 0.3 1g ml”! glutamine was used as growth medium. HEK293 cells stably 
expressing 3 X Flag-RIG-I have been described’. Granulocyte-macrophage colony- 
stimulating factor (GM-CSF) bone marrow-derived DCs were prepared as described’*. 
MDAS ‘and RIG-I-’~ and littermate control MEFs were prepared from 12.5-day 
embryos using standard protocols. RIG-I/MDAS5-deficient MEFs were generated 
by CRISPR-Cas9-mediated genome engineering”. A target sequence in the first 
exon of murine RIG-I (CTACATGAGTTCCTGGCTCGAGG (PAM motif under- 
lined)) was chosen and appropriate oligonucleotides were cloned into the BbsI site 
of pX458 (pSpCas9(BB)-2A-GFP; obtained from the laboratory of Feng Zhang via 
Addgene (Cambridge, MA; plasmid 48138)) according to the cloning protocol pro- 
vided by the Zhang lab (http://www.genome-engineering.org). MDA5-deficient 
MEFs immortalized with simian virus 40 large T antigen’ were transfected with the 
RIG-I targeting pX458 vector using Lipofectamine 2000 (Life Technologies) accord- 
ing to the manufacturer’s instructions. Twenty-four hours post-transfection, GFP- 
positive cells were FACS sorted and cultured at limiting dilution to pick individual 
colonies. The absence of functional RIG-I was verified in several clones by assess- 
ing loss of ifit1 induction by quantitative PCR following transfection of a known 
RIG-I agonist (ppp-IVT-RNA”””’). In experiments in which these RIG-I/MDA5- 
deficient MEFs were used, the parental immortalized MDAS5 ‘~ MEF line was 
included as control. 

Human PBMCs were isolated as described" from whole human blood ofhealthy, 
voluntary donors by Ficoll-Hypaque density gradient centrifugation (Biochrom 
Berlin, Germany) and cultured in RPMI 1640 supplemented to contain 10% FCS, 
1.5mM L-glutamine, 100 U ml’ penicillin, and 100 tg ml’ streptomycin. 
Viruses. All infection work was carried out according to the requirements for han- 
dling biological agents in Advisory Committee on Dangerous Pathogens hazard 
groups. Reovirus strains type 1 Lang (T1L) and type 3 Dearing (T3D) were recovered 
by plasmid-based reverse genetics from cloned T1L and T3D cDNAs, respectively”®. 
The reovirus strain T1L used for mass spectrometry analysis was a kind gift from 
S. Paludan, Aarhus University, Denmark. For in vivo infections, the T3SA+ strain 
was used as it readily binds to sialic acid and enhances reovirus infection through 
adhesion to the cell surface”. Reovirus T3SA+ was generated by reassortment of 
reovirus strains T1L and type 3 clone 44-MA”. Virions were purified as described”®. 
IAV strain A/Puerto Rico/8/1934 H1N1 was provided by T. Muster (University of 
Vienna, Austria), and EMCV was obtained from I. Kerr. 

Nucleic acid preparations. Reovirus genomic RNA was extracted from purified 
viral particles using TRIzol LS (Life Technologies). L-A virus transcripts were gen- 
erated by in vitro transcription using purified L-A virions as described’. Electro- 
phoresis using 0.8% agarose gels in Tris/borate/EDTA buffer (89 mM Tris, 89 mM 
boric acid, 2 mM EDTA, pH 8.3) was used to separate reovirus L, M, S, and <S seg- 
ments and L-A virus genomes from transcripts. Bands were visualized following 
ethidium bromide (Sigma-Aldrich) staining using an ultraviolet transilluminator 
anda Dimage Xt digital camera (Minolta, Osaka, Japan). The 1 kb Plus DNA ladder 
from Life Technologies was used as a reference. RNA segments were purified from 
gels by adding one volume of UltraPure phenol (Life Technologies) per volume of 
gel and precipitated using ethanol. For mass spectrometry analysis, reovirus gen- 
ome segments were purified by differential LiCl precipitation*’. Single-stranded 
RNA was removed by addition of 2 M LiCl, incubated overnight at 4 °C and cen- 
trifuged at 16,000 r.c.f. for 20 min. Subsequently, long dsRNA was precipitated by 


addition of LiCl to 4 M final concentration. The final supernatant (4 M LiCl), which 
contains small RNAs, was discarded. Purity and integrity of dsSRNA-preparations 
was assessed using agarose gel electrophoresis followed by ethidium bromide staining. 

For the isolation of RNA from reovirus infected cells, one 80% confluent 145 cm? 
plate of L929 cells was infected at a multiplicity of infection (MOI) of 1 plaque form- 
ing unit (PFU)/cell. Forty-eight hours later, RNA was isolated using TRIzol (Life 
Technologies) according to the manufacturer’s protocol. A similar protocol was 
used for the isolation of RNA from IAV or EMCV-infected Vero cells at 24h post- 
infection. 800ngml | of RNA was used for reporter assays. 

ppp-IVT-RNA®"., which corresponds to the first 99 nucleotides of the neomycin- 
resistance marker, was prepared using in vitro transcription as described’ and used 
at concentrations of 200-500 ng ml”. ppp-IVT-RNA”**" corresponds to 933 nucle- 
otides of the Renilla luciferase coding sequence. The double-stranded ppp-IVT- 
RNA??" was generated by annealing sense and anti-sense ppp-IVT-RNA??"", 
respectively. To synthesize p-, pp-, and ppp-IVIT-RNA”", DNA oligonucleotides 
sense (5'-AAAGGATCCATTTAGGTGACACTATAGACACACACACACACA 
CACACTTTCTCGAGAAA-3’) and antisense (5’-TTTCTCGAGAAAGTGTGTG 
TGTGTGTGTGTGTCTATAGTGTCACCTAAATGGATCCTTT-3’) were annealed 
and cloned into pcDNA3.1/V5-His-TOPO. The resulting plasmid was digested with 
Xhol and BglII (New England Biolabs, Ipswich, MA) and gel-purified (QIAquick 
Gel Extraction kit, QIAGEN). The in vitro transcription was carried out overnight 
at 37 °C with Sp6 RNA polymerase (Sp6 MEGAscript kit, Ambion, Life Technol- 
ogies) using GMP, GDP, or GTP (purchased from Sigma-Aldrich (St. Louis, MO) 
and Carbosynth (Compton, United Kingdom). After DNaseT1 treatment (Ambion) 
and size-exclusion purification (Illustra microspin G25 column; GE Healthcare), the 
RNA was purified using Ultra Pure phenol:chloroform:isoamylalcohol (25:24:1) 
(Life Technologies) and precipitated using ethanol. The purified RNA was annealed 
to a synthetic anti-sense RNA oligonucleotide (5'-GAAAGUGUGUGUGUGUG 
UGUGUGUC-3’; Sigma-Aldrich) by heating for 5 min at 65 °C, followed by cool- 
ing to room temperature before storage at —20 °C. 

5'ppp-RNA™™ (GACGCUGACCCUGAAGUUCAUCUU) was synthesized chem- 
ically as described”°. 5’ pp-RNA**™ was generated by incubating 5‘ ppp-RNA~*™ with 
vaccinia virus capping enzyme (Epicentre, Illumina, Madison, WI) for 3 h at 37 °C 
in absence of GTP and S-adenosyl methionine, otherwise according to the manufac- 
tures protocol. 5'p-RNA™*" and AS RNA was derived from Biomers, (Ulm, Germany) 
or Eurogentec, (Seraing, Belgium). Quality control was performed by mass spectrom- 
etry as described''". MALDI ToF characterization of 5'ppp-RNA**™ and 5’pp- 
RNA**" was performed and spectra were measured using a Bruker Biflex III with 
linear detection mode and a proprietary Sequenom matrix by Metabion/Martinsried 
(Germany). For stimulation assays, RNAs were annealed with non-modified oli- 
gonucleotides with complementary sequence. 
Enzymatic treatment of RNA. CIP, SAP (New England Biolabs) and RNA 5’- 
polyphosphatase (Epicentre) were used according to the manufacturer’s instructions. 
For Terminator nuclease (Epicentre) digestion, RNA was denaturated for 3 min at 
98 °C and immediately cooled on ice (melt and snap cool) after which Terminator 
N buffer and 1 ul of Terminator nuclease with or without 0.5 pl vaccinia virus cap- 
ping enzyme were added. The mixture was incubated for 30 min at 30 °C and another 
30 min at 37 °C. With all enzymatic treatments, control reactions omitting enzymes 
were carried out in parallel. RNA samples were recovered by extraction with phenol: 
chloroform:isoamylalcohol or TRIzol. Nucleic acid pellets were resuspended in 
RNase/DNase free water (Ambion), and concentrations were measured using a 
Nanodrop (Thermo Scientific). ReoT3D vRNA samples treated with or without Ter- 
minator N were spiked with 10 jug of human RNA to ensure efficient precipitation 
and to quantify precipitation efficiency (qPCR of GAPDH, data not shown). For 
RNase T2 digestion of reovirus genome, 40 pg of RNA was diluted to 0.3 ug pl 
and denatured by heating for 3 min to 95 °C. Subsequently, the RNA was incubated 
with 150 U of RNase T2 (MoBiTec GmbH, Gottingen, Germany) in 125 mM NH4Ac 
for 4h at 37 °C. To ensure a complete reaction, the digest was then heated for 60s 
at 95 °C, another 150 U of RNase T2 were added and incubation was continued for 
1 hat 37 °C. Quantitative digestion was verified by agarose gel electrophoresis and 
the digestion products were analysed by ESI-LC-MS. 
Detection of IFN stimulatory activity. The IFN-f luciferase promoter reporter was 
employed as described’. Cells (1 X 10°-2.5 X 10°) were plated in 0.5 ml of antibiotic- 
free medium and transfected one day later with a mix of 0.125 1g of IFN-f luciferase 
promoter reporter (F-luc) and 0.025 1g of Renilla luciferase control (R-luc) using 
Lipofectamine 2000 (Life Technologies). After 8 h, various concentrations of sam- 
ples (for example, IVT-RNA) were transfected into cells using Lipofectamine 2000. 
Luciferase activity was quantified 24 h later using the Dual-Luciferase Reporter Assay 
System from Promega (Fitchburg, WI). Firefly luciferase activity was normalized 
to Renilla luciferase, and fold inductions were calculated relative to a control trans- 
fection with water only (F-luc/R-luc). 50, 10, 2, and 0.4ng ml! of L, M, Sand <S 
reovirus segments were used in reporter assays (Fig. 1). 
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For assays using MEFs, 5 X 10* cells per well were plated into wells of 24-well 
plates and transfected with test or control RNAs using Lipofectamine 2000. After 
incubation overnight (16h), murine IFN-« (multiple subtypes) in culture superna- 
tants was quantified by ELISA as described’, or MEF RNA was extracted. A similar 
protocol was employed for assays using bone-marrow derived DCs. Where indi- 
cated, cells were IFN-pre-treated as a means to upregulate RLR-expression with 
500 units ml”! of IEN-A/D (PBL Assay Science) for 24h. DCs were transfected with 
200ng ml ' of reoT1L or reoT3D vRNA or 100ng ml of total L-A RNA. 

For assay using PBMCs, 4 X 10° cells were cultured in 96-well plates for stimula- 
tion experiments. To inhibit TLR7/8 activity, cells were pre-incubated with 2.54. ml! 
chloroquine for 30 min before transfection of RNA using Lipofectamine 2000 (Life 
Technologies). Human IFN-« levels in culture supernatants 20 h post-transfection 
were determined by ELISA as described". 

Quantitative PCR for ifnb1 and ifit1. For real-time quantitative (q)PCR of cell- 
culture samples, total RNA was isolated using the RNeasy Mini kit (Qiagen, Hilden, 
Germany) combined with QIAshedder (Qiagen) and RNase-free DNase I treatment 
(Qiagen). Mouse tissues (intestine and MLN) were first homogenized using stain- 
less steel beads and the TissueLyzer II (Qiagen). RNA was extracted using TRIzol, 
treated with DNase I, and purified using the RNeasy Mini kit (Qiagen). To measure 
ifnb1 and ifit1 expression, CDNA was prepared following instructions in the Super- 
Script II kit (Life Technologies). Real-time PCR reactions were carried out using an 
ABI 7500 Fast or ViiA 7 real-time PCR system with TaqMan universal master mix 
and the following primers: ifnb1 (Mm00439546_s1), ifit1 (Mm00515153_m1),and 
gapdh (4352932E) (Applied Biosystems, Life Technologies). Relative expression (RE) 
was determined using the AB 7500 Real-Time PCR System (Applied Biosystems) 
and analysed by comparative C, method using the SDS v1.3.1 Relative Quantifica- 
tion Software. 

Reovirus replication assay and quantitative PCR. For in vitro replication assays, 
2 X 10° DCs or 3 X 10° MEFs were seeded into wells of 6-well plates (Corning, 
Corning, NY) and adsorbed in triplicate with reovirus T3D at an MOI of 0.1 PFU 
per cell for 1 h at room temperature in serum-free medium, washed once with PBS, 
and incubated in serum-containing medium for various intervals. Twenty-four or 
forty-eight hours post-infection RNA from each sample was purified as described 
above and quantified by RT-qPCR using a modification of a previously described 
protocol”’. Following in vitro and in vivo assays, reovirus S4 vVRNA was quantified 
using 1-4 1g of total RNA extract. Forward (S4 83F, 5’-CGCTTTTGAAGGTCGT 
GTATCA-3’) primer was used for reverse transcription before reverse (S4 153R, 
5'-CTGGCTGTGCTGAGATTGTTTT-3’) primer was added for qPCR amplifica- 
tion. The S4-specific fluorogenic probe used was 5'-dFAM-AGCGCGCAAGAG 
GGATGGGA-BHQ-1-3' (Biosearch Technologies, Petaluma, CA). After denatur- 
ing RNA for 3 min at 95 °C, reverse transcription was performed for 15 min at 50 °C 
and terminated by incubation for 3 min at 95 °C. Subsequently, 40 cycles of g@7CR 
were performed (95 °C for 15s; 60 °C for 30s). Reovirus $4 copy numbers were 
calculated relative to a standard curve prepared using tenfold dilutions of purified 
reovirus T3D vRNA as template RNA. The final S4 RNA copy number was normal- 
ized to the total sample RNA used per reaction. The S4 segment threshold cycle (C;) 
value of each sample was also used to determine the relative expression of $4 to that 
of gapdh. 

Reovirus strand-specific reverse transcription and quantitative PCR. Strand- 
specific reverse transcription was carried out by mixing reoT3D vRNA with reverse 
primers (positive-strand-specific reverse transcription) or forward primers (negative- 
strand-specific reverse transcription), as indicated below, heating to 98 °C and 
snap-cooling on ice before incubating for 5 min at 60 °C. Reverse transcription 
was performed at 42 °C for 30 min (Revert AID, Thermo Scientific). Subsequently, 
40 cycles of qPCR were performed (95 °C for 15s; 60°C for 20s; 72 °C for 20s) 
using EvaGreen qPCR master mix (Biotium, Hayward, USA). Primers used were: 
1A T3DL175-5', CACTGACCAATCGAATGACG; 1A’ T3D L1 262-3’, GCACA 
CGGTTTAGAGCATCA; 1B T3D L1 3525-5’, TGTGCAATTAGCCAGAGTGG; 
1B’ T3D L1 3718-3’, TCGCAGTCATTACCATTCCA; 2A T3D L2 31-5’, GGTG 
AGACTTGCAGACTCGTT; 2A’ T3D L2 130-3’, CCCCGGATTAGCATCTAGG; 
2B T3D L2 3780-5’, T@CTACCTCAAGATTGGGATG; 2B’ T3D L2 3902-3’, 
GCTCACGAGGGACAGTGAG; 3A T3D L3 31-5’, GAAGACAAAGGGCAAA 
TCCA; 3A’ T3D L3 130-3’, GCCAGCCTTATTGTTTTGCTT; 3B T3D L3 3741-5’, 
CGCAGATACAACTGCCTGAA; 3B’ T3D L3 3856-3’, TTGGGAGGATGAGG 
ATCAAG; 4A T3D M1 16-5’, GGCTTACATCGCAGTTCCTG. 

4A’ T3D M1 120-3’, GAAACGTCATTCGCGTCAG; 4B T3D M1 2178-5', GAG 
CTGCATACAGTGCGAGA; 4B’ T3D M1 2298-3’, GCGCGTACGTAGTCTTA 
GCC; 5A T3D M2 19-5’, ACTCTGCAAAGATGGGGAAC; 5A’ T3D M2 120-3’, 
CGATGGTACAGCGGTAGATG; 5B T3D M2 2089-5’, AATCGTCTAATCGCC 
GAGTG; 5B’ T3D M2 2199-3’, ATTTGCCTGCATCCCTTAAC; 6A T3D M3 20-5’, 
TGGCTTCATTCAAGGGATTC; 6A’ T3D M3 138-3’, ATCCACAGACGGAG 
TGAAGG; 6B T3D M3 2129-5’, CAGCTGATGGTGTTGCTGAC; 6B’ T3D M3 
2228-3’, CGGGAAGGCTTAAGGGATTA; 7A T3D S1 18-5’, TCCTCGCCTAC 
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GTGAAGAAG; 7A’ T3D S1 163-3’, GGGTGATCCGGAGGATAGTA; 7B T3D 
$1 1268-5’, AGCAGTGGCAGGATGGAGTA; 7B’ T3D S1 1374-3’, GAAACTA 
CGCGGGTACGAAA; 8A T3D S82 53-5’, GGTTTGGTGGTCTGCAAAAT; 8A’ 
T3D S2 178-3’, TAGCTAAACCCCTCCCAAGG; 8B T3D S2 1218-5’, GCAATG 
GGGACGAGGTAATA; 8B’ T3D S82 1319-3’, GTCAGTCGTGAGGGGTGTG; 
9A T3D $3 19-5’, GTCGTCACTATGGCTTCCTCA; 9A’ T3D $3 118-3’, AGGA 
CCGCAGCATGACATA; 9B T3D S3 1083-5’, TG€ACGCCAGTGATGCTAGAC; 
9B’ T3D S3 1186-3’, TCACCCACCACCAAGACAC; 10A T3D S4 54-5’, GGTCA 
TCAGGTCGTGGACTT. 

10A’ T3D S4 153-3’, CTGGCTGTGCTGAGATTGTT; 10B T3D S4 1039-5’, CTCC 
TGCTGCTCTCACAATG; 10B’ T3D S4 1148-3’, CTGTGAAGATGGGGGTGTTT. 
Mice and in vivo infection studies. All mice used in this study were bred in spe- 
cific pathogen-free (SPF) conditions by the Cancer Research UK - Biological Resources 
Unit. Experiments were performed in accordance with national and institutional 
guidelines for animal care and approved by the Institutional Animal Ethics Com- 
mittee Review Board, Cancer Research UK. Wild-type C57BL/6J mice were pur- 
chased from Charles River Laboratories (Wilmington, MA). The C57BL/6 MA vs/— 
(also known as Cardif ’~) mice were provided by J. Tschopp (deceased). RIG-I 
(Ddx58) B6;129X1(ICR)-Ddxs8"""“*' and MDAS (Ifih1) B6;129X1-Ifih1 "4 mice 
were obtained from S. Akira (Japan) and backcrossed once to C57BL/6J mice. For 
infection studies, 6-8-week-old male or female mice were inoculated perorally with 
10° PFU of reovirus T3SA+ in 200 ul borate-buffered saline (0.13 M NaCl, 0.25 mM 
CaCh, 1.5mM MgCl X 6H20, 20mM H3BOs, and 0.15 mM Na2B4O7 X 10H20) 
containing 5 gl * of gelatine (Sigma-Aldrich). Forty-eight hours post-infection, 
mice were euthanized, and organs (intestine and MLNs) were harvested and pro- 
cessed. Investigator was blinded when processing and assessing outcome by giving 
a unique number to each animal, which was independent of genotype. No random- 
ization to experimental groups was required for these studies and minimum sam- 
ple size of n = 5 per group was chosen. 

RIG-I immunoprecipitation. For immunoprecipitation (IP) studies using stable 
Flag-RIG-I clones, one 80% confluent 145 cm” plate of HEK293 cells stably express- 
ing Flag-RIG-I was washed in phosphate-buffered saline (PBS) and infected with 
reoT1L at an MOI of 100 PFU per cell in FCS-free medium. Cells were incubated 
for 1 hat 37 °C and 10% CO, before adding an equivalent volume of medium sup- 
plemented to contain 20% FCS. Four hours later, cells were washed with ice-cold 
PBS and lysed in 4 ml of ice-cold buffer C (0.5% NP40, 20 mM Tris-HCl pH 7.5, 
150 mM NaC]; 2.5 mM MgCl; complete protease inhibitor (Roche Applied Sciences), 
0.1 Uml ! RNasin (Promega)). Lysates were incubated on ice for 30 min and cen- 
trifuged at 20,000g for 15 min to remove cell debris. Resulting supernatants were 
divided equally and incubated with anti-Flag M2 antibody (Sigma-Aldrich) or con- 
trol mIgG1 (BD Biosciences, Franklin, NJ) for 1.5-2 h at 4°C on a rotating wheel 
before adding Gamma Bind Plus Sepharose beads (GE Healthcare). Two hours 
later, beads were collected by centrifugation and washed five times for 2 min with 
1-1.5 ml of buffer C. Bead samples were divided for RNA extraction with UltraPure 
phenol:chloroform:isoamylalcohol (25:24:1) (Life Technologies) or subjected to 
immunoblotting. RNA isolated from beads was resuspended in 20 pl of RNase- 
free water and transfected into HEK293 cells expressing the IFN-f reporter. For IP 
using recombinant RIG-I, 1 1g of purified protein was incubated with different RNA 
in lysis buffer C for 2 h at 4 °C on a rotating shaker. The rest of the IP was performed 
using the protocol employed for the RIG-I IP from infected cells. 

Alpha Screen RIG-I-binding assay. The binding affinity of RNA for (Hisg)-Flag- 
tagged wild-type RIG-I was determined as described'* using an amplified lumi- 
nescent proximity homogenous assay (AlphaScreen; Perkin Elmer, Waltham, MA). 
Purified (Hiss)-Flag-RIG-I was incubated with increasing concentrations of bio- 
tinylated RNA (non-triphosphorylated antisense RNA is 5'biotinylated) for 1 hat 
37°C in buffer (50 mM Tris/pH 7.4, 100 mM NaCl, 0.01% Tween20, 0.1% BSA) 
and subsequently incubated for 30 min at 25 °C with (Hisg)-Flag-RIG-I-binding 
nickel-chelate acceptor beads (Perkin-Elmer) and biotin-RNA-binding streptavi- 
din donor beads (Perkin Elmer). 

Liquid chromatography-mass spectrometry (LC-MS) analysis. Samples of GMP, 
GDP, and GTP were analysed by LC-MS using an Agilent 1100 LC-MSD. Analysis 
was carried out using a Zorbax Eclipse XDB-C8 Rapid Resolution HT 3.0 X 50 mm 
1.8 micron column. Buffer A is 1% acetonitrile and 0.08% trifluoroacetic acid in 
milliQ water, and Buffer B is 90% acetonitrile and 0.08% trifluoroacetic acid in milliQ 
water. Flow rate was 0.425 ml min !, and the gradient was 0%-40% Buffer B over 
8 min. The MS was conducted with positive polarity, fragmentor voltage was 170, 
drying gas flow 121 min’ ', drying gas temperature 350 °C, and nebuliser pressure 
40 pounds per square inch above atmospheric pressure. Mass spectra were registered 
in full-scan mode (m/z 200 to 3,000 step size 0.15). RNA digests were analysed by 
electrospray ionization (ESI)-LC-MS performed by Axolabs GmbH (Kulmbach, 
Germany) using a Dionex Ultimate3000 RS system coupled to a Bruker maXis Q- 
ToF mass spectrometer. The samples were analysed with an improved version of 
the protocol established for ribonucleotide digestion analysis*’. Analysis of 4 pmol 
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and 50 pmol of an equimolar solution of chemically synthesized pp-RNA”“™ and 
ppp-RNA**"' treated with RNase T2 served as control. Characterization of reovi- 
rus genome RNA was performed with 50 il (13 ug) of the RNase T2 digest. 
Poly(I:C) studies. Poly(I:C) was obtained from Amersham (GE Healthcare Life 
Sciences) and treated or not with the dsRNA-specific endoribonuclease, RNase III 
from Ambion (Life Technologies) as specified by the manufacturer. Digestion was 
performed at room temperature and halted at 1 min or 5 min following enzyme addi- 
tion using 125 mM EDTA. Poly(I:C) samples were purified using UltraPure phenol: 
chloroform:isoamylalcohol (25:24:1) (Life Technologies) and precipitated using 
ethanol before being treated with CIP and re-purified as done following RNase III 
treatment. A fraction of the samples were electrophoresed in 0.8% agarose gels and 
visualized using ethidium bromide, whereas another fraction was transfected into 
SV40 large T antigen-immortalized MDA5 /~ or RIG ’~ MEFs. Cells (5 X 10“) in 
aliquots of 0.5 ml antibiotic-free medium were placed into wells of 24-well plates 
and transfected one day later. For the IFN-B luciferase promoter reporter assay, 
cells were transfected with a mix of 0.3 1g of IFN-B luciferase promoter reporter 
(F-luc) and 0.05 j1g of Renilla luciferase control (R-luc) using Lipofectamine 2000 
(Invitrogen). After 8h, various concentrations of samples were transfected using 
Lipofectamine 2000. Luciferase activity was quantified 24h later using the Dual- 
Luciferase Reporter Assay System from Promega. Firefly luciferase activity was nor- 
malized to Renilla luciferase, and fold inductions were calculated relative to a control 
transfection using water only (F-luc/R-luc). For RT-qPCR analysis of ifit1 levels, cells 
were transfected with various concentrations of samples using Lipofectamine 2000. 
Samples were harvested 16h later, and RNA was extracted as described above. 


Statistical analysis. Statistical analyses were performed using unpaired, two-tailed, 
Student’s t-tests or two-way ANOVAs. P values of less than 0.05 were considered 
statistically significant. 
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Extended Data Figure 1 | RNA from reovirus and L-A virus induce a 
RIG-I-dependent IFN response that requires 5’-diphosphates. a, Total RNA 
purified from reoT1L particles (VRNA) was treated or not with calf intestinal 
phosphatase (+ CIP). RNA integrity was verified by gel electrophoresis (left 
panel) or transfected into HEK293 cells to determine its capacity to stimulate 
the IFN-B promoter using a reporter assay (right panel). b, L, M, and S reoT1L 
genome segments were isolated by gel fractionation and treated or not with 
CIP. An aliquot of the treated samples was electrophoresed in a 0.8% agarose gel 
to validate RNA integrity (left panel), whereas another was transfected into 
HEK293 cells to determine its capacity to stimulate the IFN-8 promoter using a 


reporter assay (right panel). c, d, Total L-A RNA as well as gel-purified L-A 
genomes and transcripts (as in Fig. 1i) were transfected into RIG-I‘’~ or 
RIG-I-‘~ MEFs (c) and MDA5*’* or MDA5 ‘~ DCs (d). After incubation 
for 16h, the relative expression (RE) of ifitl over gapdh (c) or murine IFN-« 
levels (d) were determined. Water and ppp-IVT-RNA””", poly(dA:dT), or 
RNA isolated from Vero cells infected with encephalomyocarditis virus 
(Vero-EMCV-RNA) were included as controls (* = none detected). All 
experiments were performed at least twice; one representative experiment 

is shown. 
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Extended Data Figure 2 | RIG-I associates with stimulatory RNA following 
reovirus infection. This experiment was conducted exactly as in Fig. 2a but 
using strain reoT3D. 
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Extended Data Figure 3 | Characterization of guanosine sources and GMP spiked with GTP (GMP + 10% GTP) before being annealed to AS RNA 
IVT-RNA”*", a, Representative LC-MS spectra of GMP, GDP, and GTP and tested using the IFN-8 promoter reporter assay following transfection into 
sources used for the preparation of IVT-RNAs in Fig. 3. Asterisks indicate HEK293 cells. c, Spectra of 5'pp-RNA*"™ and 5'ppp-RNA*""" following 
the expected mass-to-charge ratio (m/z) of the different guanosines. MALDI ToF characterization (a.i., absolute intensity). Ions with two charges 
b, IVI-RNA”" were generated as depicted in Fig. 3a using GMP, GTP or (m2+) appear exactly at half the expected (m+) mass/charge (m/z) ratio. 
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Extended Data Figure 4 | Phosphatase treatment of poly(I:C) affects 
RIG-I but not MDA5-dependent IFN-responses. a, Schematic representation 
of inosinic acid or cytidylic acid homopolymer synthesis from inosine 
5'-diphosphate or cytidine 5'-diphosphate through the action of 
polynucleotide phosphorylase, which when annealed form the synthetic 
dsRNA analogue poly(I:C). Whether the synthesized polynucleotides carry a 
5’ di- or monophosphate or a mixture of both is unclear. b, IFN-pre-treated 
MDAS5 ‘~ or RIG-I ‘~ immortalized MEFs were transfected with poly(I:C) 
+ CIP. IFN induction was quantified 16h later using an IFN-B promoter 


reporter assay. c, Poly(I:C) was first cleaved with RNase III for 1 or 5 min 
before being treated or not with CIP (+/—). Samples were subjected to gel 
electrophoresis to verify digestion (left panel) or transfected into IFN-pre- 
treated MDA5S ’~ MEFs (right panel). Cells were harvested 16h post- 
transfection, and IFN-responses were assessed by RT-qPCR for ifit1 expression. 
Water and ppp-IVT-RNA”™ were included as controls. RE, relative 
expression. All experiments were performed at least twice; one representative 
experiment is shown. For PCR data, the mean ( + s.d.) of triplicate technical 
replicates is shown. 
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Stochasticity of metabolism and growth at the 


single-cell level 


Daniel J. Kiviet?**, Philippe Nghe't*, Noreen Walker’, Sarah Boulineau'’, Vanda Sunderlikova' & Sander J. Tans! 


Elucidating the role of molecular stochasticity' in cellular growth is 
central to understanding phenotypic heterogeneity” and the stabil- 
ity of cellular proliferation’. The inherent stochasticity of metabolic 
reaction events* should have negligible effect, because of averaging 
over the many reaction events contributing to growth. Indeed, me- 
tabolism and growth are often considered to be constant for fixed 
conditions”*. Stochastic fluctuations in the expression level’’~? of 
metabolic enzymes could produce variations in the reactions they cat- 
alyse. However, whether such molecular fluctuations can affect growth 
is unclear, given the various stabilizing regulatory mechanisms” ”, 
the slow adjustment of key cellular components such as ribosomes’*"*, 
and the secretion” and buffering’*”’ of excess metabolites. Here we 
use time-lapse microscopy to measure fluctuations in the instanta- 
neous growth rate of single cells of Escherichia coli, and quantify 
time-resolved cross-correlations with the expression of lac genes and 
enzymes in central metabolism. We show that expression fluctua- 
tions of catabolically active enzymes can propagate and cause growth 
fluctuations, with transmission depending on the limitation of the 
enzyme to growth. Conversely, growth fluctuations propagate back 
to perturb expression. Accordingly, enzymes were found to transmit 
noise to other unrelated genes via growth. Homeostasis is promoted 


a IPTG Lactulssem*" oo 


by a noise-cancelling mechanism that exploits fluctuations in the 
dilution of proteins by cell-volume expansion. The results indicate 
that molecular noise is propagated not only by regulatory proteins'*”” 
but also by metabolic reactions. They also suggest that cellular meta- 
bolism is inherently stochastic, and a generic source of phenotypic 
heterogeneity. 

To investigate the dynamics of cellular growth, we followed indi- 
vidual E. coli cells growing on different nutrients. Among them was the 
synthetic sugar lactulose”’, which is imported and catabolized by the 
LacY and LacZ enzymes like its analogue lactose, but unlike lactose 
does not induce lac operon expression (Fig. la). Mixtures of lactulose 
and the gratuitous inducer isopropyl-B-D-thiogalactoside (IPTG) thus 
allowed us to vary the mean lac expression level independently and 
hence to explore different regimes of noise transmission. We determined 
the instantaneous growth rate p(t) of individual cells within micro- 
colonies at sub-cell-cycle resolution for various growth conditions, using 
time-lapse microscopy” at high acquisition rates and automated image 
analysis (Supplementary Information). We found that u(t) varied consid- 
erably in time, both within one cell-cycle and between different cell-cycles 
(Fig. 1b, c and Extended Data Fig. 1), with noise intensities (standard devi- 
ation over the mean) ranging between 0.2 and 0.4 (Fig. 1d). Consistently, 
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Figure 1 | Growth rate variability in single E. colicells. a, Schematic diagram 
of the studied system. Lactulose is metabolized by the Jac enzymes, but does 
not induce lac expression. Mean lac expression can hence be varied 
independently by the inducer IPTG. GFP is fused transcriptionally in the lac 
operon (Extended Data Table 2). b, Aligned phase-contrast images for two 
lineages. Microcolonies were grown on polyacryl pads (0.1% lactulose and 
200 1M IPTG) for eight to nine generations. Up to 48 images were taken per 
hour. Red line: cell boundary from image analysis. c, Instantaneous growth rate 
L(t) against time, determined by fitting exponentials to the cellular length. Four 
lineages are coloured for clarity. Black bar, mean division time; light points, 
division events. d, Top: histograms of y values for different IPTG levels. 
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Bottom: noise intensity (standard deviation over the mean). e, Autocorrelation 
function of p(t) for low (4 uM, green), intermediate (6 |1M, ochre) and high 
(200 1M, brown) IPTG levels. For clarity, error bars denoting the standard 
deviation are indicated only for a fraction of the points. Black lines: exponential 
fits that provide the correlation time. Correlation functions were determined 
along the branched lineages (Extended Data Fig. 8). f, Graph of u(t) correlation 
time versus mean doubling time. Colours are as in e; black points are for 
growth on defined rich, lactose, succinate and acetate (in order of increasing 
doubling time). g-i, As c, e and f, but for the fluorescence intensity reporting 
for E(t) within single cells. Protein concentrations were determined by the 
mean fluorescence per unit area (Extended Data Fig. le-g). 
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the growth rates of sister cells were significantly correlated (Extended 
Data Fig. 2). We found that the typical timescales of the fluctuations 
were somewhat smaller than the mean cellular doubling time, as quan- 
tified by the autocorrelation functions R,,,,(t) (Fig. le, f). Such a scaling 
with doubling time is typical for protein concentration fluctuations”. 
Thus, the data indicated randomly fluctuating growth limitations, and 
suggested they could be caused by concentration fluctuations of cellular 
components. 

To study the relation between growth and Jac enzymes, we quan- 
tified the fluctuations in the Jac production rate p(t) and concentration 
E(t) using green fluorescent protein (GFP) labelling (Fig. la, g-i and 
Extended Data Fig. 1). We computed the cross-correlation functions 
Rp, {t) and Rz,(t), which indicate whether expression fluctuations cor- 
relate with -fluctuations occurring time t later, and thus inform on the 
direction of transmission®”’. Both R,,,(t) and Rz,,(t) showed positive 
correlations regardless of the IPTG concentration (Fig. 2a, e-g). Their 
shapes and symmetries did depend on IPTG, however. At low and 
intermediate IPTG, Rz,(t) was nearly symmetric around t = 0 while 
Rp, (t) was asymmetric with larger weight at t > 0 (Fig. 2e, f and Ex- 
tended Data Fig. 3). This would indicate that p fluctuations on average 
correlated more strongly with y fluctuations that occur later. Such a 
delay in y is consistent with the idea that lac expression fluctuations 
produce variations in lactulose catabolism, which in turn propagate 
through the metabolic network and perturb growth. 

High IPTG Rz,(t) displayed a positive peak at t <0 (Fig. 2g and 
Extended Data Fig. 3). Thus, E fluctuations correlated more strongly 
with p fluctuations occurring earlier, which suggested backward trans- 
mission from growth to expression. Such a growth-to-expression cou- 
pling could be caused by specific regulatory interactions’****, or more 
generally by growth fluctuations that cause variations in general com- 
ponents that are required for transcription and translation. Overall, the 
data suggested that noise not only propagated forward, from express- 
ion to growth, but also backward, from growth to expression. 

To determine whether back-and-forth transmission could explain 
the correlations, we developed a stochastic model. A black-box approach 


LETTER 


was followed, in which noise propagation is represented by phenom- 
enological transmission coefficients that do not specify molecular details 
(Fig. 2b). Despite the circulating noise, the system could be decomposed 
into distinct noise transmission modes; here termed the lac catabolism, 
common noise and dilution modes (Fig. 2d). The cross-correlation 
curves for all induction levels (Fig. 2e-g) were fitted jointly, using the 
transmission strength from the common noise source to p as a single 
free parameter (Fig. 2h-j). 

The effects of induction could be explained by altered intensities of 
the modes. At low and intermediate IPTG, the lac catabolism mode 
was dominant, with lac noise causing up to 30% of the growth noise 
(Extended Data Table 1). At higher IPTG this mode weakened because 
of decreased transmission from E to u. This decrease is plausible, as cat- 
alysed reactions are less dependent on catalyst when the latter is abund- 
ant, consistent with the observed relation between the mean E and m 
(Fig. 2c). On the other hand, the rather constant R,,,(0) (Fig. 2e—g) in- 
dicated that the common-noise mode had an almost fixed intensity for 
all IPTG concentrations. To probe the generality of this mode further, 
we made a number of genetic modifications. We found that it remained 
active when we knocked-out the lac repressor, changed the GFP posi- 
tion within the operon, altered the type of fluorescent protein or used 
an exogenous constitutive promoter (Extended Data Fig. 4a—d). These 
data suggest that common noise transmits to expression in general, which 
does not exclude additional coupling by specific regulatory interactions. 

Next, we tested key findings. First, ifthe asymmetry in Rp,,(t) (Fig. 2e, f) 
is indeed caused by lac catabolism, this asymmetry should be suppressed 
when carbon enters central metabolism via another pathway. Growth 
on acetate was similarly slow as on lactulose and low induction, but 
R,,(t) was now indeed nearly symmetric (Fig. 3a, b and Extended Data 
Fig. 3). At the same time, Rz,,(t) became more asymmetric as predicted 
for a dominant common noise mode transmission (Fig. 3a, b and Ex- 
tended Data Fig. 3). When growing on other natural substrates including 
lactose, the Rx, peak-width scaled roughly with doubling time consist- 
ent with dilution setting the transmission delay timescales (Fig. 3b and 
Extended Data Fig. 5). To test further whether Jac fluctuations could be 
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Figure 2 | Cross-correlation functions and mathematical model. 

a, Instantaneous growth rate against Jac enzyme concentration from one 
microcolony, corresponding to the cross-correlation value Rz,,(0) in e. b, Model 
of the coupling between expression and growth noise. Two noise sources are 
specific to p and ju, one is common to p and y. Correlations arise when 

noise emitted from one source is received by two observables (p, E or ,). 
Analytical solutions revealed all contributing pathways, and showed they were 
finite despite the looped network structure (Supplementary Information). 

c, The mean growth rate versus the mean expression level, as measured for 
different levels of IPTG induction. Line: fit to a Monod growth model. d, Three 
classes of noise transmission modes. As an example, a noise source (left) 
emits a block wave, giving rise to signals ju, p and E (middle) and their 
cross-correlations (right). Other pathways contribute as well. For instance, 


common noise can also drive the catabolism mode. e-g, Cross-correlation 
functions R,,,(t) for the enzyme production rate p(t) and growth rate ju(¢) (thin 
line), as well as Rz,,(t) for the enzyme concentration E(t) and u(t) (thick line). 
Growth is on lactulose (0.1%) with IPTG: 4 uM (e), 6 uM (f), 200 uM (g). 
Top triangles indicate mean division time. Error bars denoting the standard 
deviation are indicated for some data points only. The main features were 
robust to changing the growth determination method and taking the cell width 
into account (Extended Data Fig. 4e-h). Growth and expression differences 
typically did not correlate with location within the microcolony (Extended Data 
Fig. 4i). Protein production rates were determined by the time-derivative of 
the total fluorescence per cell (Extended Data Fig. le-h). h-j, Fits to the 
experimental data (e-f). 
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Figure 3 | Model predictions and experimental tests. Top: re-wired noise 
transmission networks with predicted dominant pathways (black). Coloured 
genes indicate labelling with GFP and mCherry. Middle: predicted cross- 
correlation with mean doubling time (triangle). Bottom: measured cross- 
correlation. Error bars denote the standard deviation. a, b, For growth on 
acetate the Jac enzymes are catabolically inactive. c, d, Gene with a weaker 
coupling from common noise to expression (compared with the lac operon), 


causal in the growth noise, we exposed the cells to IPTG pulses in a 
microfluidic device. The resulting pulses in ac expression were indeed 
followed by a pulse in growth (Extended Data Fig. 6a). Next, we aimed 
to mimic common noise fluctuations by growing cells on glucose min- 
imal medium and pulsing with amino acids. These pulses indeed pro- 
duced transient increases in ju and p (Extended Data Fig. 6b), consistent 
with common noise propagating to enzyme expression and to growth. 

Second, the network structure implied a homeostatic control mech- 
anism: upward fluctuations in common noise increase E when trans- 
mitted via p, but also decrease E when transmitted via yu (Fig. 2b). These 
opposing effects offer a direct prediction: if the positive pathway dom- 
inates, Rp,{t) should be positive, as is the case so far. If the negative path- 
way would dominate, however, Ry,,(t) should become negative (Fig. 3c). 
One cannot manipulate how volume changes affect dilution. To tilt the 
balance, we thus looked for constructs with a weaker coupling to com- 
mon noise in the positive pathway, as measured by R,,,(0). A constitu- 
tively expressed mCherry with a twofold lower R,»,(0) indeed displayed 
negative Rp,,(t) (Fig. 3d and Extended Data Fig. 3). Thus, two parallel 
antagonistic pathways that together form a so-called incoherent feed- 
forward network motif* can partly cancel noise. This cancelling also 
explains why Rz,,(0) is low even though R,,,(0) is high at high induc- 
tion where common noise dominates (Fig. 2g). Interestingly, while up- 
fluctuations in yz are associated with up-fluctuations in E (Fig. 2g), 
increases in mean ji lead to decreases in E (Extended Data Fig. 5e)'?”°. 
These opposing dependencies suggest that different mechanisms under- 
lie these two types of expression variation. 

Third, if lac enzymes transmit to growth and growth transmits to 
expression in general, then lac enzymes ought to transmit also to other 
genes. Hence we quantified p*(t) of mCherry controlled by promoters 
with no known functional interactions with the lac system. For lactu- 
lose and low induction, mCherry fluctuations indeed occurred after lac 
fluctuations on average (Fig. 3f and Extended Data Fig. 7a, b) in ac- 
cordance with predictions (Fig. 3e). In contrast, this delay was absent 
for acetate, which is consistent because lac then does not transmit to 
growth (Extended Data Fig. 7c, d). Noise in lac expression can thus 
couple to other genes without specific regulatory interactions. 

For the lac genes, the lac catabolism mode transmitted to growth 
only when the mean lac expression was kept artificially low and limited 
the mean growth rate. Hence, we wondered whether limiting enzymes 
in central metabolism could similarly perturb growth. For growth on 
lactose, glycolysis is considered limited by pfkA, and the tricarboxylic 
acid cycle by icd but not by gitA; while in acetate, gitA is limiting, icd 
may be limiting but pfkA is not****. We indeed observed positive time 
delays in Rp, for pfkA and icd in lactose, and for gitA and icd in acetate, 
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leading to dominant dilution. e, f, Transmission from the lac genes to another 
gene via growth. When the lac genes do not transmit because cells grow on 
acetate, the correlation is symmetric (Extended Data Fig. 7c, d). g, Time delays 
for lac, pfkA, gItA and icd in lactose (not boxed) and acetate media (boxed), as 
derived from the correlation functions R,,,(t) (Extended Data Fig. 7e). Small 
square boxes indicate which gene is considered limiting in steady-state in a 
particular medium (see main text). 


but not in the other cases (Fig. 3g and Extended Data Fig. 7e). This pat- 
tern of correlation delays is consistent with the mechanism found for 
lac, in which growth limitation in steady-state resulted in noise trans- 
mission to growth. Notably, the differences in noise transmission be- 
haviour were observed for enzymes catalysing nearby reactions in the 
pathway. For instance, icd acts almost directly after gltA, but icd dis- 
played delayed correlation in lactose while gitA did not. This excludes 
the possibility that the delayed correlations are caused by synchronous 
fluctuations of pfkA, gItA, icd and other central metabolic genes. Together, 
the results indicate that expression-to-growth noise propagation occurs 
more generally for limiting genes. 

Our study shows that fluctuations in gene expression can affect the 
growth stability ofa cell, and, in turn, growth noise affects gene expres- 
sion. This entanglement between growth and expression noise reflects 
the inherent auto-catalytic nature of self-replicating systems: metabolic 
enzymes help synthesize the building blocks for their own synthesis. 
The results raise the question how different fluctuating metabolic activ- 
ities within the cell are coordinated, and which regulatory mechanisms 
are implicated in maintaining growth homeostasis. Metabolic stochas- 
ticity could allow clonal cells in a population to adopt a wide spectrum 
of metabolic states, and hence enable bet-hedging strategies to exploit 
new conditions optimally. Metabolic stochasticity could representa ge- 
neric source of cellular heterogeneity”, but also prevent optimal growth” 
and limit efficient biosynthesis. Novel approaches are required to in- 
corporate noise transmission within the current theoretical framework 
of metabolism. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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Extended Data Figure 1 | Image analysis and determination of cell length, 
elongation rate, enzyme concentration and production rate. a, Segmented 
cell silhouettes are obtained by applying a Laplacian of Gaussian filter on phase 
contrast images. b, The cell axis is determined by fitting a third degree line 
through the silhouette. c, Cell-length determination. We compute the distances 
between points on the cell axis and the closest 25 segmentation pixels. The 
sum of these distances squared, here termed the silhouette proximity, is plotted 
for points along the cell axis. In the centre of the cell silhouette or mask, the 
silhouette proximity consistently remains at 4.06 tm’, but near the cell poles it 
rapidly increases. The location of each cell pole was taken at a silhouette- 
proximity of 4.47 jum”. d, Elongation rate of a single cell. The length of a single 
cell, its parent and its offspring plotted over time (dark circles). Instantaneous 
exponential elongation rate is determined by fitting an exponential to this 
data for a fraction of the cell cycle. At the beginning and end of each cell cycle, 


o 
B 


0.2 


noise intensity 


E (a.u.) 


length data of the parent or the offspring are used for this fitting process (grey 
circles, see Supplementary Information). e, Initial fluorescence image. f, Image 
after background correction, shading correction and deconvolution by a 
point spread function. Total cell fluorescence is determined as the sum of 
fluorescence values within the cell silhouette. g, To determine the cellular 
fluorescence intensity that reports for the enzyme concentration accurately, we 
averaged the fluorescence values of pixels within a box of fixed width and 
equidistant length from the poles inside the cell perimeter. h, Enzyme 
production rate against time p(t) for all lineages within a microcolony, from 5 h 
into the experiment and onwards. Four lineages are coloured for clarity. 
Black bar, mean division time; light points, division events. i, Cell length against 
time L(t) as in h. j, Histograms of observed E values for different IPTG 
induction levels. Bottom panel indicates the noise intensity, defined as the 
standard deviation over the mean. 
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Extended Data Figure 2 | Correlations between the growth rate of sister 
cells during growth on lactulose for increasing levels of IPTG induction. 
a, At 4uM IPTG, R= 0.72, n= 171, P< 10” (t-test). b, At 6 uM IPTG, 


R=0.42,n = 382,P<10 '°.c, At200 uM IPTG, R = 0.32,n = 314,P<10 ”. 
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d, Evolution in time of the correlation coefficient between growth rate of 
sisters, for 6 uM IPTG. A decreasing exponential was fitted with a decay time of 


2.86 h. 
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Extended Data Figure 3 | Quantification of symmetry of cross-correlation _ cycles. A positive (respectively negative) Tp indicates that the cross-correlation 
functions. For each cross-correlation (corresponding figure indicated at top), | R has more weight at positive (respectively negative) times. Error bars denote 
we computed the weighted average of the time delay tg = S- (R;-t)/ > R,, _ the standard deviation of the symmetry values determined for four sub- 
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Extended Data Figure 4 | Cross-correlations of control experiments and 
using different methods of growth-rate determination. a, Expression of lac 
ina lacI repressor knockout strain on lactose minimal medium (to be compared 
with Fig. 2g). b, Expression of lac measured with a GFP fusion to LacZ shows 
same result as co-transcriptional expression of GFP on 0.1% lactulose and 6 uM 
of IPTG (to be compared with Fig. 2f). c, Exogenous constitutive promoter 
(PN25) driving the production of GFP, inserted in the cheZ locus, on minimal 
medium with lactose. d, The Jac promoter driving the production of yellow 
fluorescent protein (YFP), inserted in the intC locus, on minimal medium with 
maltose. e, Cross-correlations for lactulose growth at low IPTG (4 1M), with 
growth rate determined as follows: S(t) is the surface area of the cell silhouette 
versus time (Extended Data Fig. 1a). The growth rate is the time derivative 
of S(t). f, The same, for lactulose growth at high IPTG (200 uM). g, Cross- 
correlations for lactulose growth at low IPTG (4 UM), with growth rate 
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determined as follows: S(t) is the surface area of the cell silhouette versus time, 
L(t) is the length of the cell silhouette versus time (Extended Data Fig. 1b, c). 
The growth rate is the derivative of L(t) x [S(t)/L(£)]?. Note that S(t)/L(£) is 
taken as a measure for the width of the cell, and the width squared times the 
length as a measure for the cell volume. h, The same, for lactulose growth at 
high IPTG (200 11M). These cross-correlations display the same shape and 
symmetry as in Fig. 2e, g, where the growth rate is determined as the derivative 
of the length of the cell silhouette (Extended Data Fig. 1d). Hence the 

central features are robust to different methods of growth rate determination. 
i, Scatter plot of instantaneous growth rate and cell position within the 
microcolony. The cell position was calculated as the minimal distance of the 
centre of a cell to the edge of the microcolony. Data obtained during growth 
on lactulose at intermediate IPTG induction (6 1M). 
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Extended Data Figure 5 | Cross-correlations for growth on different carbon 
sources. a, Schematic diagram of noise transmission during growth on lactose, 
which is predicted to be similar to the case of growth on lactulose at high 
IPTG induction (see Fig. 2g, j). b, Corresponding measured cross-correlations. 
c, Theoretical cross-correlations obtained by using the parameters during 
growth on lactulose and changing exclusively the population average growth 
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rate to the experimentally measured value. This prediction displays a positive 
asymmetric peak towards negative time and a width scaling with the average 
growth rate. d, Corresponding measured cross-correlations. e, Population 
average lac enzyme concentration versus the population average growth rate on 
minimal medium supplemented with varying carbon sources. 
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Extended Data Figure 6 | External media perturbations in microfluidic 
device. a, Growth of AB460 in microfluidic device (see Supplementary 
Information) on M9 medium with 0.1% lactulose, 0.01% Tween-20 and 16 UM 
IPTG. A 2-h pulse to medium with 3 1M IPTG is indicated in red. Black line is 
the mean, and grey area is the standard deviation, of approximately 60 cells. 
Indicated are the lac production rate (p), lac concentration (E) and cell growth 
rate (i). The duration and intensity of the pulse was chosen to reflect the 
naturally occurring fluctuations in lac expression. Upon the pulse, the 
production rate transiently decreased, followed by a gradual transient decrease 
in lac concentration, and a transient decrease in growth rate. These data are 


LETTER 


b [om 
PY 


E 


= 


AA 
J l => glucose 


AMINO ACIDS PULSE 


p (a.u.) 


E(au.) 


time (h) 


consistent with the catabolism transmission mode (top). b, Growth of ASC631 
in microfluidic device on M9 medium with 0.1% glucose, 0.01% Tween-20 
and 1 mM IPTG. To mimic fluctuations in common components, a 1-h pulse of 
amino acids (Teknova M2104) added to the medium is indicated in green. 
Both growth and production rate increase immediately upon addition of amino 
acids, reflecting the common noise transmission mode (top). The enzyme 
concentration remained relatively stable, showing that for these perturbations 
the production increase and dilution increase cancelled each other. These 
data are consistent with the common noise mode (top). 
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Extended Data Figure 7 | Cross-correlations of additional constructs. 

a, Transmission from lac to another gene via growth (on 0.1% lactulose and 
6 tM IPTG) shown by the asymmetric cross-correlations between lac 
production rate and mCherry production driven by the constitutive bla 
promoter. b, The same for lac production rate and mCherry driven by the mel 
promoter induced by 0.2% melibiose (AmelA strain). c, Symmetric cross- 
correlation between lac production rate p and other gene production rate p* 
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predicted for growth on acetate (see d). d, Absence of transmission shown 
by the cross-correlation between lac production rate p and the mCherry 
production rate p* driven by the constitutive PN25 promoter, on minimal 
medium with 0.1% acetate, consistent with predictions (c). e, Cross- 
correlations (Rp,,) for lac, pfkA, gitA and icd in lactose (left) and acetate 
media (right). 
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Extended Data Figure 8 | Extracting and weighing lineages from a branched 
data set. a, Depiction of a growing microcolony over time, starting with two 
cells on the left and growing into five cells on the right. b, A lineage tree of 
the data shown in a. The tree starts with two lines (left), indicating the two 
starting cells, and at each division the line splits, resulting in five cells at the end 
(right). c, Five lineages can be extracted from the data. Note that most lineages 
share part of their data. When correlating data points from f with ft, one 
pair consists of completely independent data points (lineage I). Two lineages 
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provide exactly the same pairs of data points (lineages IV and V), and two 
lineages only share a data point at fo (lineages II and III). d, Different 

types of weighing for the correlation of data points from fg with f, as used in 
equation (6) in Supplementary Information. No: each lineage is weighed 
equally. Unique pairs: weighing such that only comparisons between unique 
data pairs are used. Unique points: lineages II and III are not completely 
independent, which can be corrected for by this weighing from equation (5) 
in Supplementary Information. 
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Extended Data Table 1 | Contribution of noise transmitted from lac 
concentration E to different variables in various culture media 


Lactulose experiments Noise observed in Transmitted from E 


low induction 12% 
(iptg = 4 uM) E 34% 
31% 
intermediate induction 9.80% 
(iptg = 6 uM) E 19% 
yu 20% 
Constitutive gene p* 13% 
(iptg = 6 uM) iE* <1% 


The contribution of noise transmitted from E was computed by comparing the coefficient of variation of 
a given variable with or without transmission from E, using the values fitted with the model. Note that a 
decomposition of noise as asum of coefficient of variations is not possible here, given the feedback of E 
on itself, which leads to self-sustained fluctuations which impact the noise intensity in a non-additive 


way. 
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Extended Data Table 2 | List of strains used in this study 


Strain Genotype Origin 

AB460 AlacA::gfp-cat Constructed by A. Bohm 
ASC631_— AlacA::gfp-cat, Aphp::Pry2s-mCherry-kan® — This study 

ASC636 ss AlacA::gfp-cat, ACheZ::Py2s-mCherry-kan® This study 


ASC638 AChEZ::P os-gfo-kan® This study 
ASC639_ — AlacA::gfp-cat, Alacl::kan® This study 
ASC662 lacZ-gfp This study 
ASC640 = AlacA:-gfp-cat, Aphp::Bla-mCherry-kan® _‘ This study 
ASC644_ —s AlacA::gfp-cat, AmelA::mCherry-kan® This study 


ASC666 —L31::mChery- kan®, gitA::gfpA206K-cat This study 
ASC677 —sL.34::mCherry-kan®, pfkA::gfpA206K-cat _ This study 
ASC678 —L.31::mCherry-kan®, icd::gfpA206K-cat This study 
MG22 AintC PL -lac01::yfp Elowitz lab 
NCM520 AlacAYZ Obtained from the Coli Genetic Stock Center 
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CRISPR- mediated direct mutation of cancer genes in 


the mouse liver 
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The study of cancer genes in mouse models has traditionally relied 
on genetically-engineered strains made via transgenesis or gene tar- 
geting in embryonic stem cells’. Here we describe a new method of 
cancer model generation using the CRISPR/Cas (clustered regularly 
interspaced short palindromic repeats/CRISPR-associated proteins) 
system in vivo in wild-type mice. We used hydrodynamic injection 
to deliver a CRISPR plasmid DNA expressing Cas9 and single guide 
RNAs (sgRNAs)*~ to the liver that directly target the tumour sup- 
pressor genes Pten (ref. 5) and p53 (also known as TP53 and Trp53) 
(ref. 6), alone and in combination. CRISPR-mediated Pten muta- 
tion led to elevated Akt phosphorylation and lipid accumulation in 
hepatocytes, phenocopying the effects of deletion of the gene using 
Cre-LoxP technology”*. Simultaneous targeting of Pten and p53 
induced liver tumours that mimicked those caused by Cre-loxP- 
mediated deletion of Pten and p53. DNA sequencing of liver and 
tumour tissue revealed insertion or deletion mutations of the tumour 
suppressor genes, including bi-allelic mutations of both Ptenand p53 
in tumours. Furthermore, co-injection of Cas9 plasmids harbouring 
sgRNAs targeting the B-catenin gene and a single-stranded DNA 
oligonucleotide donor carrying activating point mutations led to 
the generation of hepatocytes with nuclear localization of B-catenin. 
This study demonstrates the feasibility of direct mutation of tumour 
suppressor genes and oncogenes in the liver using the CRISPR/Cas 
system, which presents a new avenue for rapid development of liver 
cancer models and functional genomics. 

The prokaryotic type II CRISPR/Cas genome editing tools have been 
successfully applied in many organisms, including mouse and human 
cells**”"". The system offers sequence-specific direct editing of DNA; 
therefore, unlike RNA-interference-based approaches”, this method 
can achieve complete loss-of-function of the encoded protein. In rodent 
and primate embryonic stem cells or zygotes, CRISPR has been applied 
to efficiently generate mutant alleles or reporter genes'*"’’. Our groups 
have previously shown that in vivo delivery of CRISPR can repair a 
disease gene in mouse liver”’. However, generation of somatic cancer 
mutations in adult animals using CRISPR has not, to our knowledge, 
been reported. 

To investigate the potential of the CRISPR system to directly induce 
loss-of-function mutations in vivo, we chose to target the tumour suppres- 
sor gene Pten, which is a negative regulator of the phosphatidylinositol- 
3-kinase (PI3K)/Akt pathway’. Mutation and genomic loss of Pten has 
been identified in many types of human cancer” and liver-specific knock- 
out of Pten in mice induces lipid accumulation and late-onset liver 
cancer’*. We cloned a pX330 vector? co-expressing an sgRNA targeting 
Pten (Pten target sequence 1 in Supplementary Table 1, termed sgPten) 
and Cas9. We first showed that sgPten could induce Pten mutations 
in mouse 3T3 cells following transfection (Extended Data Fig. 1 and 
Supplementary Table 5). To deliver CRISPR to the liver in adult mice, 


we employed hydrodynamic tail-vein injection (Fig. 1a), which can 
deliver DNA to ~20% of hepatocytes for transient expression*’. As 
shown in Fig. 1b, hydrodynamic injection of a luciferase plasmid DNA 
resulted in liver-specific expression of luciferase in mice. We next in- 
jected a cohort of FVB mice (an inbred wild-type mouse strain) with 
sgPten and an equal number of mice with a pX330 plasmid encoding an 
sgRNA targeting GFP (sgGFP) as a control. In parallel, we genetically 
deleted Pten in the liver of Pten-floxed mice® (Pten™") via tail-vein in- 
jection of adenovirus expressing the Cre recombinase (adeno-Cre). Two 
weeks later, immunohistochemical (IHC) staining ofliver sections from 
five of the sgPten-treated mice using a Pten-specific antibody revealed 
3.3 + 0.5% hepatocytes with negative Pten staining, surrounded by Pten- 
positive cells (Fig. 1c, dand Extended Data Fig. 2a—c). Importantly, the 
liver is composed of a mixture of diploid and polyploid hepatocytes, 
and we cannot determine the ploidy of the Pten-deficient cells. Thus, 
they may be a mixture of cells with two or more mutated Pten alleles. 
A lower percentage (0.4 + 0.1%) ofhepatocytes showed intermediate Pten 
staining (Fig. 1d and Extended Data Fig. 2c), potentially indicating het- 
erozygous Pten mutation in diploid cells or incomplete mutation in poly- 
ploid cells. Coincident with negative Pten staining, we detected elevated 
staining of phospho-Akt (pAkt), a biomarker of the PI3K pathway ac- 
tivity, in sgPten-treated (n = 5) and adeno-Cre-injected Pten™" mice 
(n = 5) (Fig. 1c bottom panel and Extended Data Fig. 2d). Histological 
analysis and Oil Red O staining at two months showed hepatocytes with 
lipid accumulation in sgPten-treated FVB mice (n = 5) and adeno-Cre- 
treated Pten™"” mice (Extended Data Fig. 3), which is a known pheno- 
type associated with Pten mutation in the liver”®. These data indicate 
that in vivo CRISPR-mediated genome editing was able to generate Pten- 
negative cells in the liver, mimicking liver-specific conditional deletion 
of Pten in mice. 

To confirm that loss of Pten staining and function occurred due to 
CRISPR-mediated mutation of Pten, we performed deep sequencing on 
the captured targeted region of Pten locus of total liver genomic DNA. 
Sequencing revealed that 2.6 + 1.4% of the sequencing reads had inser- 
tion or deletion mutations (indels) at the Pten locus in sgPten-treated 
mice (n = 5) compared to 0.5 + 0.1% in sgGFP-treated mice (n = 3, P= 
0.02) (Fig. le). In the sgPten-treated livers, most of the sequence variants 
were predicted to cause frameshift mutations as inferred from insertion 
length and/or phase (Fig. 1f, g and Extended Data Fig. 4). For example, 
we observed frequent occurrence of 1-nucleotide or 2-nucleotide indels, 
which would lead to disruption of the Pten reading frame (Fig. 1f-h and 
Supplementary Table 4). These indels clustered at the predicted sgPten- 
induced Cas9 cutting site (Fig. 1h and Extended Data Fig. 4a), whereas 
the indels detected in sgGFP samples distribute randomly at low fre- 
quency, probably due to background PCR errors or sequencing errors 
(Extended Data Fig. 4b). Notably, in five independent mice, the fre- 
quency of Pten loss scored by IHC (including both full and partial loss 
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Figure 1 | Hydrodynamic injection of CRISPR deletes Pten in a subset of 
hepatocytes in mice. a, pX330 plasmids expressing Cas9 and sgRNA targeting 
Pten (sgPten) were hydrodynamically injected into wild-type FVB mice to 
transiently express the CRISPR components in hepatocytes. A cartoon of 
sgRNA is shown in the top right of the panel. b, Bioluminescence imaging 

of mice hydrodynamically injected with a luciferase plasmid shows liver- 
specific luciferase expression (n = 3). c, Representative haematoxylin and eosin 
(H&E) and IHC staining of FVB mice injected with sgGFP (as a control) or 
sgPten, and Pten™" mice injected with adeno-Cre for 2 weeks. Note the 
hepatocytes with clear cytoplasm on H&E sections, indicating lipid 
accumulation. Scale bars are 50 jtm. The insets are high-magnification views 
(X400). d, Percentage of hepatocytes with negative or intermediate Pten 
staining. Error bars are standard deviation (s.d.) (n = 5 mice). e, Pten indel 


of signal) strongly correlated with the frequency of Pten indels (Fig. 1i, 
R* = 0.81). These data indicate that for most cells, expression of the 
sgPten vector results in complete mutation ofall Pten alleles present in 
the cell. Because non-parenchymal cells in the liver generally do not take 
up DNA following hydrodynamic injection, it is not surprising that 
the indel frequency in liver genomic DNA and the frequency of Pten- 
negative hepatocytes are not strictly equal. 

To assess the long-term phenotype following sgPten treatment, we 
harvested livers from three sgPten-treated mice at four months. As shown 
in Fig. 2a, these livers exhibited regions of hepatocytes with prominent 
lipid accumulation, loss of Pten and increased pAkt staining, which 
phenocopies Pten-knockout mice”*. To address whether sgPten induces 
p53 in hepatocytes, we performed p53 IHC on sgPten-treated liver sec- 
tions at 14 days and 4 months. sgPten liver sections did not stain posi- 
tively for p53, despite elevated pAkt (Fig. 2a and Extended Data Fig. 5), 
suggesting that Pten loss does not activate the p53 pathway in the liver 
at these time points. Given the long tumour latency of liver tumours in 
Pten-knockout mice (44-74 weeks)’, we did not observe liver tumours 
in sgPten-treated mice at time points up to 4 months. 

Recent studies identified that Cas9 can tolerate mismatches between 
sgRNA and genomic DNA depending on the sgRNA sequence and the 
position of the mismatches””’. To characterize potential off-target effects 
of sgPten in the liver, we identified top-ranking sgPten off-target geno- 
mic sites in the mouse genome (Extended Data Fig. 6a) using a pub- 
lished prediction tool’. We amplified the Pten locus and the top four 
potential off-target sites from sgGFP- and sgPten-treated livers and mea- 
sured CRISPR editing using the Surveyor assay’. In the sgPten-treated 
livers, the assay revealed 2.3 + 0.4% (n = 2) indels at the Pten locus. In 
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frequency in the total liver genomic DNA. Error bars are s.d. (n = 3 mice 

for sgGFP and n = 5 mice for sgPten). f-h, Representative Pten indels in 
sgPten-treated mice. f, Distribution of indel length. g, Distribution of indel 
frame phase calculated as the length of indels modulus 3. For example, 1-, 

4- and 7-base-pair indels are 3N+ 1, 2-, 5- and 8-base-pair indels are 3N+2 and 
3-, 6- and 9-base-pair indels are 3N. The pie chart shows the percentage of each 
class of indel types. h, Representative views of Pten indels in sgPten-treated 
mice using the Integrative Genomics Viewer. Black or purple bars indicate 
deletions or insertions, respectively. Arrowhead denotes predicted Cas9 cutting 
site. A full list of indels can be found in Supplementary Table 4. PAM, 
protospacer adjacent motif. i, Correlation between Pten loss determined by 
IHC and deep sequencing. Each dot is an individual mouse treated with sgPten. 


contrast, we did not detect Surveyor nuclease cutting at the assayed off- 
target sites (Extended Data Fig. 6b), indicating that the frequency of 
off-target editing is below the limit of detection of this assay. Deep se- 
quencing of an sgPten-treated liver sample revealed that the indel fre- 
quency within 10-nucleotide regions around the top three predicted 
cutting sites was <0.1% (Supplementary Table 6). 

We next tested a nickase version of Cas9, Cas9(D10A), which only 
makes single-strand DNA (ssDNA) breaks and was reported to have 
further reduced levels of off-target effects**. We designed a pair of Pten 
sgRNAs (Fig. 2b) predicted to generate off-set ssDNA breaks. The 
Cas9”'°4 plasmid and the two PCR products containing a U6 promoter 
driving expression of off-set Pten sgRNAs (termed sgPten.2/3) were in- 
troduced into FVB mice (n = 5) by hydrodynamic injection. U6-sgGFP 
PCR DNA served as a control (n = 5). By deep sequencing of the liver 
genomic DNA isolated from two mice at two weeks post-injection, we 
observed 2.7 + 0.1% indels at the Pten locus in sgPten.2/3-treated mice 
compared to 0.2 + 0.2% in sgGFP-treated mice (n = 2) (Fig. 2c, d and 
Extended Data Fig. 4c and Supplementary Table 4). Pten-negative cells 
were observed in the sgPten.2/3 livers (2.8 + 0.4%) but not in sgGFP 
controls (0.0 + 0.0%) by IHC staining (Fig. 2e) (n = 5). 

To test whether CRISPR-mediated mutation can target other tumour 
suppressor genes in vivo, we designed constructs to mutate p53 (also 
known as TP53 and Trp53), which is the most frequently mutated tu- 
mour suppressor gene in human cancer®. Exome-sequencing studies 
have identified frequent mutations of p53 and PTEN in human cho- 
langiocarcinoma”’. An sgRNA construct targeting p53 was cloned into 
the pX330 plasmid (termed sgp53)”*. Transfection of sgp53 into 3T3 cells 
led to frequent p53 indels, as measured by deep sequencing (Extended 
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Figure 2 | Long-term effects of sgPten in the liver and off-set CRISPR 
double nickase strategy. a, IHC on serial liver sections from sgPten-treated 
mice at 4 months post-injection. Scale bars are 100 jum. n = 3 mice. The lower- 
left insets are high-magnification views (120). The “p53 ON’ inset is a p53- 
restored liver tumour as a positive control”. b, FVB mice were injected with 
Cas9”!°4 plus sgGEP (as a control) or plus a pair of off-set sgRNAs targeting 
Pten (sgPten.2 and sgPten.3) to introduce double nicking. Red arrowheads 


Data Fig. 7 and Supplementary Table 5). We next injected a cohort of 
FVB mice with sgp53 alone. These mice did not exhibit liver tumours at 
three months post-injection (Extended Data Fig. 8a), which is consist- 
ent with previous studies showing that liver-specific p53-knockout mice 
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denote predicted Cas9”!™ cutting sites. Total liver genomic DNA was analysed 
by deep sequencing for Pten indels. Representative sequences are shown. 

Red lines denote deletions, the black arrow denotes an insertion. c, Frequency 
of Pten indels (n = 2 mice). Error bars are s.d. d, Frame phase of Pten indels 
calculated as the length of indels modulus 3. e, H&E and Pten IHC staining of 
liver sections. Arrows denote cells showing negative Pten staining or lipid 
accumulation. n = 5 mice. Scale bar is 50 im. 


develop liver tumours only after 14 months”. We also performed deep 
sequencing of sgp53-treated livers at 14 days and detected 6.0 + 0.1% 
indels at the p53 locus (Extended Data Fig. 8b and Supplementary Table 4), 
demonstrating that sgp53 can directly generate mutations in p53 in the 
mouse liver. 

In an effort to mutate two tumour suppressor genes simultaneously, 
we co-injected sgPten and sgp53 into FVB mice (Fig. 3a). As shown in 
Fig. 3b and Extended Data Fig. 9, indels were observed in total liver DNA 
isolated from two animals, at frequencies of 4.0 + 0.1% for Pten and 
6.4 + 0.1% for p53, enriched at the predicted cutting sites. At 3 months 
post-injection, all 5 mice co-injected with sgPten and sgp53 developed 
liver tumours with bile duct differentiation features (Fig. 3c), whereas 
none of sgGFP- injected mice (n = 5) developed tumours. The tumours 
were positive for cytokeratin 19, a marker of bilinear lineage cells” 
(Fig. 3c). Pten",p53/' conditional knockout mice (n = 5) injected with 
adeno-Cre also developed liver tumours of similar histology at 3 months 
(Fig. 3c). When injected alone, neither sgPten nor sgp53 caused any 
detectable tumours at this time point. Sequencing of the sgPten- plus 
sgp53-induced liver tumours and tumour-derived cell lines (n = 5 tumours 
analysed) showed bi-allelic mutations of both genes (Fig. 3d and Sup- 
plementary Table 7). These results demonstrate that CRISPR-mediated 
mutation of Pten and p53 can induce liver tumour development, sup- 
porting the use of multiplexed CRISPR editing of cancer genes, at least 
in this tissue. 


Figure 3 | Multiplexed CRISPR targeting Pten and p53 induces tumour 
formation in murine liver. a, pX330 plasmids expressing sgPten and sgp53 
were hydrodynamically injected into FVB mice. b, Frequency of Pten and p53 
indels quantified by Illumina MiSeq (n = 2 mice) at 14 days post-injection. 
Error bars are s.d. c, Representative H&E and IHC staining of FVB mice 
injected with sgGFP (as a control) or sgPten + sgp53, and Pten™":p53™" mice 
injected with adeno-Cre. Arrows indicate cytokeratin-19-positive bile duct cells 
in sgGFP mice. Scale bars are 100 um. n = 5 mice. d, Representative sequences 
of Pten and p53 loci in sgPten + sgp53 induced liver tumours (nm = 5 tumours). 
Red lines denote deletions, black arrows denote insertions. 
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Figure 4 | CRISPR introduces f-catenin mutations in the liver. a, FVB mice 
were co-injected with two sgRNAs targeting the B-catenin gene Ctnnb1 
(sgB-catenin) and a 200-nucleotide ssDNA oligonucleotide containing four 
alanine point mutations (red) which abolish the phosphorylation of serine and 
threonine sites of B-catenin. Codons are underlined. Protospacer adjacent 
motif (PAM) sequences are marked in blue. b, Quantification of hepatocytes 
with nuclear B-catenin IHC staining at day 7. Mice were injected with indicated 
combination. sgGFP serves as a control sgRNA. n = 5 mice, *P < 0.05. Error 
bars are s.d. c, IHC on serial liver sections. Glutamine synthetase (GS) is 
normally expressed surrounding the central veins (left). Arrowheads indicate 


To determine whether CRISPR can be used to directly introduce gain- 
of-function mutations, we targeted the Ctnnb1 gene, which encodes 
B-catenin, a transcription factor in the Wnt signalling pathway that is 
frequently mutated in liver cancer’*. Phosphorylation of four serine/ 
threonine residues leads to degradation of B-catenin (Fig. 4a)**. We 
co-injected FVB mice with two pX330 plasmids carrying sgRNAs tar- 
geting Ctnnb1 (termed sgf-catenin) and a 200-nucleotide ssDNA oligo- 
nucleotide containing four point mutations that cause serine/threonine 
to be replaced with alanine (Fig. 4a), which together have been shown 
to abolish phosphorylation and cause stabilization and nuclear local- 
ization of B-catenin (ref. 28). In mice injected with either sgB-catenin 
or ssDNA alone (n = 5), B-catenin was localized only at cell junctions 
as shown by IHC (Fig. 4b, c). In contrast, in five mice injected with sgB- 
catenin and ssDNA, we observed that ~0.5% of hepatocytes exhibited 
nuclear B-catenin at 7 days post-injection (Fig. 4b, c). Moreover, accu- 
mulation of B-catenin was associated with increased levels of glutamine 
synthetase, a b-catenin target gene” in the liver (Fig. 4c and Extended 
Data Fig. 10a) and reduced phospho-f-catenin (Fig. 4d). In addition, 
we subjected the liver DNA from two mice treated with this combina- 
tion to deep sequencing. The data confirm that a small but detectable 
percentage of sequencing reads contained the four ‘G’ point mutations 
present in the ssDNA (Fig. 4e and Extended Data Fig. 10b). Single-guide 
B-catenin also generated indels clustered at the predicted Cas9 cutting 
sites (Supplementary Table 8). These data demonstrate that CRISPR 
system can be used to directly induce gain-of-function mutation or other 
substitutions via homologous recombination in vivo”. 

Our data illustrate the potential to directly disrupt tumour suppres- 
sor genes and generate point mutations in oncogenes in adult mouse 
liver using the CRISPR/Cas system. This method bypasses the need to 
engineer embryonic stem cells and to breed multiple mutant animals 
together to generate compound mutants. This approach generated com- 
pound Pten and p53 indels at low frequency but was sufficient to induce 
multifocal tumours. We anticipate that this method will allow for more 
rapid testing of any single genes or gene combinations that are suspected 
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overlap of B-catenin and GS staining outside the central vein (CV) region 
(right). Scale bars are 100 um. The insets are high-magnification views (400). 
d, Single confocal sections show nuclear B-catenin and loss of cytoplasmic 
phospho-f-catenin in the cell indicated by an arrow. Scale bars are 100 jim. The 
insets are X 400 magnification views. e, Representative Ctnnb1 deep sequencing 
reads in sgf-catenin + ssDNA-treated mice. Each grey bar represents a 
sequencing read. Ctnnb1 reference sequence is shown at the bottom. Reads 
matching the reference sequence are in grey. Variant bases are in colours 

(G in orange). n = 2 mice. 


of being capable of initiating tumour formation in the liver. Given the 
number of candidate cancer genes being discovered through next gen- 
eration sequencing efforts, simplified methods of testing the oncogenic 
properties of candidates in vivo are critical. To increase the sensitivity 
of the assay, one could perform the CRISPR/Cas9-mediated mutagen- 
esis on a ‘sensitized’ background carrying constitutive or conditional 
mutations in a tumour suppressor gene such as p53. More efficient 
delivery techniques, such as adenovirus or adeno-associated virus (F.A. 
Ran et al., submitted), more potent sgRNAs, and longer homologous 
recombination templates might also improve the overall efficiency of 
this method and expand the range of tissue that could be targeted. Con- 
sistent with recent studies showing that long-term Cas9/sgRNA ex- 
pression is not toxic in cells”*®, hydrodynamic injection of Cas9/sgGFP 
in mice was well tolerated and did not trigger weight loss in mice”. How- 
ever, further studies are required to fully evaluate the side effects of the 
CRISPR system in mice and other organisms. This study underscores 
the power of the CRISPR/Cas9 system for rapid genome editing and 
the development of novel cancer models in the mouse. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

CRISPR vectors. pX330 vector’ was digested with BbsI and ligated with annealed 
oligonucleotides (Supplementary Table 1). An extra G is added for sgRNAs lacking 
a5’ G for U6 transcriptional initiation. Cas9°'* nickase vector was from addgene’. 
sgPten.2 and sgPten.3 were PCR amplified from empty pX330 plasmid using the U6 
forward primer and sgRNA reverse ultramer oligonucleotides’ (Supplementary 
Tables 2 and 3) and PCR purified. 

Mice and hydrodynamic injection. All animal study protocols were approved by 
the MIT Animal Care and Use Committee. Cohorts of Pten” and Pten™ p53 up 
mice were infected with 1 X 10° (Fig. 1) or 1 X 10° (Fig. 3) plaque-forming units 
(PFU) of adeno-Cre (University of Iowa) in 100 pl PBS by intravenous injection. 
Vectors for hydrodynamic tail-vein injection were prepared using the EndoFree- 
Maxi Kit (Qiagen). For hydrodynamic liver injection, plasmid DNA suspended in 
2 ml saline was injected via the tail vein in 5-7 s into 8-week-old female FVB/NJ mice 
(Jackson lab). No randomization or blinding was used. The amount of injected DNA 
was 60 [1g for sgPten, 60 jig each for sgPten + sgp53, 40 pig Cas9?! + 2119 se Pten2 
PCR + 2 jig sgPten.3 PCR for off-set sgRNA study, and 30 1g sgB-catenin.1, 30 pg 
sgB-catenin.2 and 60 jig ssDNA for the B-catenin experiment. An equal amount of 
sgGFP was used asa control for each experiment. The Pten™".p53/" and sgPten + 
sgp53 mice were dosed with CCl, as in ref. 31. 

Immunohistochemistry and immunofluorescence. Mice were killed by carbon 
dioxide asphyxiation. Livers were fixed in 4% formalin overnight, embedded in 
paraffin, sectioned at 4 |tm and stained with hematoxylin and eosin (H&E) for pa- 
thology. Liver sections were de-waxed, rehydrated and stained using standard immu- 
nohistochemistry protocols™. The following antibodies were used: anti-Pten (Cell 
Signaling, 9559, 1:100), anti-pAkt $473 (Cell Signaling, 4060, 1:50), b-catenin (BD, 
610154, 1:100), anti-p53 (CMS, 1:300), anti-glutamine synthetase (BD 610517, 1:200) 
and anti-cytokeratin 19 (Abcam, ab133496, 1:100). The number of hepatocytes was 
quantified from >3 low-magnification fields per mouse with 5 mice per group. 
Immunofluorescence was performed as previously described. B-catenin (BD,610154) 
and phospho--catenin (Abcam, ab53050) antibodies were used. Slides were coun- 
terstained with 4,6-diamidino-2-phenylindole (DAPI). Images were obtained with 
a Nikon AIR laser scanning confocal microscope using a X40 APO Fluor objective 
(NA 0.65). 

Bioluminescence imaging. 5 ,1g luciferase plasmids were hydrodynamically injected 
into FVB mice. 24h later, bioluminescence imaging (Xenogen) was performed as 
previously described”. 


LETTER 


Genomic DNA purification and Surveyor assay. Genomic DNA from liver or 
tumour tissue was purified from High Pure PCR Template Preparation Kit (Roche 
11796828001). 3T3 cells were transiently transfected with pX330 and a GFP plas- 
mid. Top 20% GFP* cells were sorted by FACS and genomic DNA was purified at 
72h post-transfection. Off-target sites were predicted using http://crispr.mit.edu/. 
For the Surveyor assay, PCR products (Supplementary Table 3) were gel purified 
and treated with Surveyor nuclease kit (Transgenomic). DNA was separated on 
4-20% Novex TBE Gels (Life Technologies) and stained with ethidium bromide. 
Quantification of surveyor bands was as in ref. 9. For sequencing of liver tumours 
and matched tumour-derived cell lines, PCR products of the Pten and p53 genomic 
regions were cloned using Zero Blunt TOPO PCR Cloning Kit (Life Technologies) 
and analysed by Sanger sequencing. One tumour from each animal was analysed. 
Deep sequencing of CRISPR modified Pten and p53 loci. The genomic region 
of Pten and p53 was PCR amplified using Herculase II high-fidelity polymerase 
and gel purified. Libraries were made from 50 ng of the PCR products using the 
Nextera protocol and sequenced on Illumina MiSeq (150 base pair (bp) paired- 
end) and HiSeq2500 (100 bp paired-end, B-catenin samples) machines. Data were 
processed according to standard Illumina sequencing analysis procedures. Briefly, 
reads were mapped to the PCR amplicons as reference sequences using Burrows— 
Wheeler Aligner with custom scripts**. Insertions and deletions were crosschecked 
against reference using VarScan2. Indel phase was calculated as the length of in- 
sertions or deletions modulus 3. The rate of §-catenin donor integration was cal- 
culated as donor allele frequency. Indels at Pten G304 exist at the same frequency 
across all samples, thus possibly arise from PCR or sequencing errors, and were 
filtered out in final analysis. Two to five biological replicates were sequenced for 
in vivo liver samples. One DNA sample was sequenced for Pten and p53 in 3T3 
cells and for Pten off-target sites in sgPten-treated liver. To compare the editing 
efficiency at Pten off-target sites, the indel frequency within 10-nucleotide regions 
(20 nucleotides total) flanking Pten A124, OT1 G209, OT2 G451, OT3 G265 was 
calculated. 

Statistics. P values were determined by Student’s t-tests and ANOVA for all IHC 
quantifications. 


31.  Zender, L. etal. Identification and validation of oncogenes in liver cancer using an 
integrative oncogenomic approach. Cel! 125, 1253-1267 (2006). 

32. Xue, W. et al. Response and resistance to NF-«B inhibitors in mouse models of 
lung adenocarcinoma. Cancer Discov. 1, 236-247 (2011). 

33. Chen,S. etal. Global microRNA depletion suppresses tumor angiogenesis. Genes 
Dev. 28, 1054-1067 (2014). 
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Extended Data Figure 1 | Representative Pten indels in sgPten-treated 3T3 
cells. Mouse 3T3 cells were co-transfected with sgPten and a GFP plasmid. The 


highest 20% of GFP-positive cells were sorted to enrich for cells expressing indicate deletions or insertions, respectively. Other colours indicate 
sgPten. Deep sequencing of the Pten locus revealed 36.4% Pten indels in this SNPs. a, Pten PCR region. b, Zoom in view. n = 1 DNA sample. c, Distribution 
context (Supplementary Table 5), presumably due to the more efficient delivery of Pten indel length. 


of sgPten via cell culture transfection and sorting. Red arrowheads denote 
predicted Cas9 cutting sites. Black or purple bars in grey sequencing reads 


©2014 Macmillan Publishers Limited. All rights reserved 


a sgGFP 


LETTER 


b sgPten 


Extended Data Figure 2 | CRISPR generates Pten-negative hepatocytes 

in vivo. a,b, Low-magnification images of Pten IHC in sgGFP- (a) and sgPten- 
treated (b) mice. Scale bar is 100 tum. c, d, IHC on serial sections from 
sgPten-treated mice. Black arrows denote cells with negative Pten staining and 
positive pAkt staining. White arrowhead denotes cells with intermediate 
Pten staining, potentially indicating heterozygous Pten mutation or multi- 
nucleated hepatocytes with partial Pten loss. Insets show high-magnification 
THC images. Scale bar is 100 tm. n = 5 mice. The frequency of Pten-deficient 


cells is probably a reflection of the transduction efficiency following 
hydrodynamic injection and the time required to achieve mutation. A recent 
study by our groups has shown that ~17% of hepatocytes were Flag—Cas9 
positive as indicated by IHC 24h after hydrodynamic injection, only 1.4% 

of cells on day 7, and less than 0.3% at one month”. Given that Cas9-mediated 
genome editing usually takes more than 48 h (ref. 2), the fraction of hepatocytes 
that productively express Cas9 and an sgRNA after hydrodynamic injection 
is estimated to be less than 17%. 
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Extended Data Figure 3 | sgPten induces lipid accumulation in the liver. sections were stained for Oil Red O, a marker for lipid accumulation. Scale bars 
FVB mice were injected with sgGFP or sgPten (n = 5). 2 months later, liver are 50 um. 
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samples show indels peaking at the predicted Cas9 cutting site whereas sgGFP 
indels distribute randomly. c, Representative indel frequency in 
Cas9?!°4 + sgPten.2/3-treated mice. For all panels */+x denotes insertion 


of x nucleotides, */—x denotes deletion of x nucleotides. 
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Extended Data Figure 5 | Pten deletion in the liver does not induce p53. p53 IHC. n = 3 mice. Scale bars are 50 tm. Positive control from a p53- 
Liver sections from sgGFP- or sgPten-treated mice at 2 weeks were stained for _ restoration tumour is shown in the inset of Fig. 2a (p53 ON)”. 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


© [io “sequence (2ontepan) [score | _ritsmatches [| ucscgene [lous] 
| AJ AGATGGTGACCAGAAACAAACAG | 2.3] 3mMs[5:8:10] | chr?:-56172579_ 
| 2] AAATCATCAGCAGAAACAAACAG |S] 3MiMs (26:8) | chr5:-81530627 
| S| AGAGTGTTAGCAGAAACAATTGG | 14] 3mMs{4:5:20] | chr 422233525 
| A|GCATTGTTACCAGAAACAAACAG | 1.3] 4MiMs[1:2:5:10] [| NR_045386 | chrig:+49962998 
| S| CAATGGTTAACAGAAACAAAAAG | 1.3] aims (1:25:10) [chr 3:451519545 
|G AGATTGTTAAAAGAAACAAATAG | 1.3] 3MMs [5:10:11] [ chrt428342977 
| 7 AAATGGTCACCAGAAACAAAAAG | 1.3] aMMs(2:5:8:10) [  chri6:+974aa451 
| S|ATATGGTTAGCAGAAAAAAAAAG | 1.3] 3MMs[2:5:17] | chr2-118247411 
| S|AGAAAGTTAGCAGAAACATATGG || 3MMs[4:5:19] | chrt+614g4552_ 
| _10[ AGATTGGTGGCAGAAACAAACAG | af 3Mims(5:7:9] [| ch r3:+80611324 
| 11] AGAGCACTAGCAGAAACAAAGGG | af MMs [46:7] [| chr2:-114897270_ 
| 12] AGATTGTTATCACAAACAAATGG | 3MMs [5:10:13] [| chré:-74006099 
| 13] AAATCATTAGAAGAAACAAAGAG | os] 3MMs[2-6:11] [| chra:t71593592 
| 14| TAATTGTTTGCAGAAACAAAAGG | os] aMmMs[1:2:5:9] [| chrto:-20775651_ 
| 15] AGACAGATAACAGAAACAAATAG | os] aMMs(4:5:7:10) [| chr 2:-67842263 
|_16|GAATCTTGAGCAGAAACAAATGG | 0.8] ams (1:2:6:3] [I chr3:-19546766 
| _17| AAATTATCAGCAGAAACAAATGG | 0.8] 4MMs(2:5:6:8] [| chr6:-31440319 
| 18|AACTCCTAAGCAGAAACAAAAGG | 0.8] 4MMs(2:3:6:8] [| chr3:493150452_ 
| _19|ATAATTTTAGCAGAAACAAATGG | 0.8] 4MMs[2-4:5:6] | chrx+5661815 
| 20| TGATCATAAACAGAAACAAATAG | 0.8] aMms (1:68:10) [| chr9:+8.2430572 
| 21] AAAAGGTTAGGAGAAACAAAAAG | 0.8] aims (2:45:14) [| chr5:-84250058_ 
| 22| AGATGGCTAGCAGAAAAAAAAGG | 0.8] 3MMs[5:7:17] | chr:4122331741 
| 23[ AGGAAATTAGCAGAAACAAAAAG | 0.8] 4mmMs[3-4:5:6] [| chri6:+76862019 
| 24| AGACTGTTGACAGAAACAAAAAG | 0.8] ams (4:5:9:10) [| chr9-4831483 
|_ 25] AGGTGTTTATCAGAAACAAAAGG |] 0.8] ais (3:5:6:10) [  chrig-21529555 
|_26] AGAATTTTAACAGAAACAAACAG | 0.8] aims [4:5:6:10) [chr 7:482812663 
| 27] AGAAATTTACCAGAAACAAAGAG | 0.8] ams (4:5:6:10) [| chrx:-143681805 
| 28] AGATCGTAGGCAGAAACAAGGAG | 0.8] 3MMs[a-9:20] | chr 7486085464 
| 29] AGATACTGATCAGAAACAAATAG | O.7| aMms(5:6:8:10) [| chrig-79273102_ 
| 30] TGACTGTTAGCAGAAACAATAGG | o7} amas (14:5:20) [chr 2:457558222_ 


b Off-target sites 


Pten 1 3 2 4 
sgPten:- + - + - + - + - + 


y ee8e 4 


N 
S 
/ 
A 
! 


= < = 
uncut: 345 679 542 553 429 (bp) 


cut: 221+ 208+ 278+ 450+ 242+ 
124 471 264 103 187 


Extended Data Figure 6 | Assessing off-target cutting of sgPten. a, Top 30 __ treated liver genomic DNA. Pten and Pten off-targets sites 1, 2, 3 and 4 were 
potential off-target sites for sgPten in the mouse genome. Score is likelihood | PCR amplified. Predicted size of uncut and cut bands are indicated. Red 

of off-target binding. Only site 4 is in the exon region of NR_045386, whichis arrowheads indicate Surveyor-nuclease-cleaved Pten PCR products. The data 
a long non-coding RNA. b, Surveyor assay in sgGFP (—) and sgPten (+) are representative of two independent liver samples. 
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Extended Data Figure 7 | Representative p53 indels in sgp53 treated 3T3 in grey sequencing reads indicate deletions or insertions, respectively. a, p53 
cells. Red arrowheads denote predicted Cas9 cutting sites. Black or purple bars PCR region. b, Zoom in view. n = 1 DNA sample. 
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Extended Data Figure 8 | Analysing sgp53-treated livers. a, Histology of sgp53-treated livers. Scale bars, 50 fm, n = 3 mice. b, p53 indel frequency was measured 
by MiSeq at day 14. Error bars are s.d., n = 2 mice. 
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Extended Data Figure 9 | sgPten- and sgp53-generated indels in the liver. 
sgPten and sgp53 were co-injected into FVB mice. Representative analysis of 
MiSeq is shown. n = 2 mice. a-c, Pten locus. d-f, p53 locus. a, d, Indel 
frequency. */+ indicates insertions and */- indicates deletions. Base pair 
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position denotes position along the Pten or p53 reference sequences. 
Arrowheads denote predicted Cas9 cutting sites. b, e, Distribution of indel 
length. c, f, Distribution of indel frame phase. Frame phase of indels was 
calculated as the length of indels modulus 3. 
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Extended Data Figure 10 | CRISPR introduces f-catenin mutations in the _ nucleotides. The rate of b-catenin donor integration was calculated as donor 
liver. a, Low-magnification images of glutamine synthetase (GS) IHC as in allele frequency. n = 2 mice. 
Fig. 4c. b, Frequency of Ctnnb1 deep sequencing reads with all four G 
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Rb suppresses human cone-precursor-derived 


retinoblastoma tumours 


Xiaoliang L. Xu’, Hardeep P. Singh**, Lu Wang', Dong-Lai Qi*"*, Bradford K. Poulos®, David H. Abramson*, Suresh C. Jhanwar"’ 


& David Cobrinik?*+*? 


Retinoblastoma is a childhood retinal tumour that initiates in response 
to biallelic RB1 inactivation and loss of functional retinoblastoma 
(Rb) protein. Although Rb has diverse tumour-suppressor functions 
and is inactivated in many cancers’ °, germline RB1 mutations predis- 
pose to retinoblastoma far more strongly than to other malignancies’. 
This tropism suggests that retinal cell-type-specific circuitry sensitizes 
to Rb loss, yet the nature of the circuitry and the cell type in which it 
operates have been unclear’®. Here we show that post-mitotic human 
cone precursors are uniquely sensitive to Rb depletion. Rb knock- 
down induced cone precursor proliferation in prospectively isolated 
populations and in intact retina. Proliferation followed the induction 
of E2F-regulated genes, and depended on factors having strong expres- 
sion in maturing cone precursors and crucial roles in retinoblastoma 
cell proliferation, including MYCN and MDM2. Proliferation of Rb- 
depleted cones and retinoblastoma cells also depended on the Rb- 
related protein p107, SKP2, and a p27 downregulation associated with 
cone precursor maturation. Moreover, Rb-depleted cone precursors 
formed tumours in orthotopic xenografts with histological features 
and protein expression typical of human retinoblastoma. These find- 
ings provide a compelling molecular rationale for a cone precursor 
origin of retinoblastoma. More generally, they demonstrate that cell- 
type-specific circuitry can collaborate with an initiating oncogenic 
mutation to enable tumorigenesis. 

RB1-mutant retinoblastomas can originate from a cellular state found 
during retinal development in humans but not in other species”. Accord- 
ingly, to identify the cellular state and corresponding circuitry that sen- 
sitizes to RB1 inactivation, we examined the effects of Rb depletion on 
human fetal retinal cells. Samples were from post-fertilization weeks 17- 
19, when all retinal cell types and a range of maturation states are present. 

Dissociated retinal cells were transduced with RB1-directed or con- 
trol short hairpin RNAs (shRNAs), followed by co-staining for the 
proliferation-associated Ki67 and cell-type-specific markers. RB1 shRNAs 
abrogated Rb expression in long or medium wavelength (L/M)-opsin* 
and thyroid hormone receptor 2* (TRB2~) cone precursors as well 
as in other cell types (Extended Data Fig. 1a). After 2 weeks, Ki67 was 
detected in cone-precursor-like cells co-expressing the photoreceptor 
marker CRX and the cone markers L/M-opsin, cone arrestin and RXRy 
(Fig. 1a and Extended Data Fig. 1b-h). Ki67* cone-marker™ cells were 
first detected 9 days after transduction, whereas clusters were routinely 
detected by day 23. Ki67 was not detected in cells expressing markers 
of rods (NRL), bipolar cells (strong CHX10), ganglion cells (BRN-3), or 
amacrine or horizontal cells (PROX1* or PAX6*, nestin” ) (Fig. la and 
Extended Data Fig. 1i, j). Ki67 was detected in cells expressing markers 
of retinal progenitor cells (RPCs) or Miller glia (nestin or CRALBP, SOX2), 
yet in similar proportions after RBI shRNA or control shRNA (Fig. la 
and Extended Data Fig. 1j). RBI shRNAs also induced incorporation of 


5-ethynyl-2’-deoxyuridine (EdU), an indicator of S phase entry, increased 
expression of the mitosis marker phosphohistone H3, suppressed expres- 
sion of the apoptosis marker cleaved caspase 3 (CC3), and induced prolif- 
eration in cells expressing cone but not other retinal cell markers (Fig. 1c, d 
and Extended Data Fig. 1k—n). By contrast, RBJ shRNAs induced CC3 
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Figure 1 | Proliferation of cone-like cells after Rb depletion in dissociated 
FW19 retina. a, b, Responses to RB1-733 shRNA (shRB1). D, day; SCR, 
scrambled control. a, Percentage Ki67~ among cells expressing the indicated 
retinal cell-type-specific markers. b, Prevalence of DAPI" (4',6-diamidino-2- 
phenylindole”) cells expressing cone (L/M-opsin, CRX), RPC (nestin), or 
Miiller glia (CRALBP) markers. c, d, Responses to RB1-733 and RB1-737 
shRNAs (shRB1-733 and shRB1-737, respectively). c, Proliferation of cells, of 
which >90% were cone marker” at day 60. d, Percentage CC3* among cells 
expressing the indicated markers at day 14. Values and error bars denote mean 
and s.d. of triplicate assays; P values are from unpaired Student’s f-test. All 
results replicated at least twice. 
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and decreased the number of cells expressing markers of RPCs and glia 
(Fig. 1b, d and Extended Data Fig. 1n). 

To assess whether the Rb-deficient proliferating cone-like cells derived 
from post-mitotic cone precursors, we examined the effects of Rb knock- 
down in prospectively isolated retinal cell populations. Populations were 
isolated by sorting for size, for CD133, which is expressed strongly in 
maturing photoreceptors and weakly in RPCs", and for a CD44 epi- 
tope expressed by Miiller glia and RPCs”* (Fig. 2a). Staining for cell-type- 
specific markers revealed populations enriched for cone precursors, for 
rod plus cone precursors, for RPCs plus glia, and for a mixture of rod, 
ganglion, bipolar, amacrine and horizontal cells (Fig. 2b and Extended 
Data Fig. 2a-g). In medium and large CD133", CD44 populations, 96- 

98% of cells co-stained for CRX and cone arrestin, which is cone-specific 
at post-fertilization week 19 (FW 19) (Extended Data Fig. 2h). A similar 
enrichment was observed when cone precursors were identified using 
CRX and RXRy (Extended Data Fig. 2h-k). 

RB1shRNAs induced similar RB1 knockdown in each retinal cell pop- 
ulation (Extended Data Fig. 3a). After 2 weeks, Ki67 was detected in 80% 
of cells in the cone-enriched population (Fig. 2c), probably reflecting a 
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Figure 2 | Cone precursor response to Rb depletion. a, Dissociated FW18 
retinal cells sorted by size, CD133 and CD44, with major populations 
designated. FSC, forward scatter; misc, miscellaneous; SSC, side scatter. 

b, Percentage of cone arrestin’ (cArr’), CRX* cones, NRL* rods, nestin* 
RPCs and glia, and PAX6", nestin horizontal, amacrine or ganglion cells in 
each population. Lg, large; med, medium; sm, small. c, Responses to RB1-733 
shRNA. Percentage of Ki67* or CC3* cells at 14 days (top), and cell numbers 
at days 14 and 23 (bottom). *P < 0.05, **P < 0.01 (compared to scrambled 
control). Results in a-c are representative of three independent experiments. 
d, Ki67", L/M-opsin* cone precursors (white arrows) in FW19 fovea 

15 days after transduction with RB1-733 and RB1-737 shRNAs; and Ki67*, 
L/M-opsin cells probably representing RPCs or glia (yellow arrows) after 
transduction with RBI shRNA or scrambled control. Scale bar, 20 um. Values 
and error bars denote mean and s.d. of triplicate assays; P values are from 
unpaired Student’s t-test. 
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high ratio of shRNA-expressing lentivirus to target cells and cone pre- 
cursor proliferation. After 3 weeks, cone precursor numbers increased 
(Fig. 2c). Rb depletion did not induce proliferation in RPCs and glia, 
but increased the proportion of CC3* cells entering apoptosis (Fig. 2c). 
Sorted populations transduced with the scrambled control had higher 
CC3* rates than unsorted cultures, potentially reflecting separation of 
RPCs and glia from neurons'*™. Nevertheless, Rb knockdown induced 
proliferation and apoptosis in cells with the same immunophenotypes 
as in unsorted cultures. Notably, Rb depletion induced the cell-cycle- 
related genes CCNE1, SKP2, E2F1, RBL1, CCNB1 and CDK1 in cone 
precursors, and induced p53-responsive genes in sorted RPCs and glia 
(Extended Data Fig. 3). Cell-cycle-related genes were induced several 
days before Ki67, suggesting that further reprogramming was needed 
for cell cycle entry. 

RB1shRNAsalso induced cone precursor proliferation in intact retinas. 
Ki67 was detected in L/M-opsin™ cone precursors in the fovea, demar- 
cated by cones but not rods, 15 days after transduction (Fig. 2d). Ki67 
was not detected in cells expressing rod, amacrine, horizontal or gan- 
glion cell markers (Extended Data Fig. 4a, d). Ki67 was detected in RPCs 
and glia marked by PAX6", nestin* or by CHX10*, CRX_ , yet in sim- 
ilar proportions after transduction with RB1-directed and control shRNAs 
(Extended Data Fig. 4b-d). Moreover, a yellow fluorescent protein (YFP)- 
expressing RBI shRNA vector selectively induced Ki67 in YFP* cones, 
although all cell types were transduced (Extended Data Fig. 4e-h). 

We next determined whether Rb-depleted cone precursors and reti- 
noblastoma cells depend on similar signalling circuitry. Retinoblastoma 
cell proliferation requires several proteins that are prominent in cone 
precursors, including TRB2, RXRy, MYCN and MDM2 (ref. 7). Deple- 
tion of these factors suppressed Ki67 expression and cone precursor 
proliferation both in dissociated retinal cultures (Extended Data Fig. 5a, b) 
and in isolated populations (Fig. 3a). Retinoblastoma cell proliferation 
also requires SKP2-mediated degradation of Thr 187-phosphorylated 
p27 (ref. 15). Concordantly, SKP2 depletion suppressed cone precur- 
sor proliferation and increased CC3 (Fig. 3a and Extended Data Fig. 5a). 
Notably, maturing cone precursors had exceptionally high Thr 187- 
phosphorylated p27 (Extended Data Fig. 5c), coincident with a maturation- 
associated decrease in total p27 (ref. 16), suggesting that SKP2-mediated 
p27 degradation might enable cone precursor proliferation. Consistent 
with this view, cone precursor proliferation was suppressed by ectopic 
p27 and enhanced by ectopic SKP2 or p27 knockdown (Fig. 3b and Ex- 
tended Data Fig. 5b), as in retinoblastoma cells’*. Thus, Rb-depleted cone 
precursors and retinoblastoma cells had similar signalling requirements. 

Wealso assessed the roles of the Rb-related p107 and p130 proteins 
(also known as RBL1 and RBL2, respectively). In mouse models, retinal 
tumorigenesis required loss of Rb combined with loss of p107, p130 or 
p27 (refs 10, 17). However, in human retinoblastomas, p130 (also known 
as RBL2) losses are common, whereas p107 (also known as RBL1) losses 
are rare’® (Extended Data Fig. 6a). Moreover, whereas maturing cone 
precursors had abundant p130 and minimal p107, retinoblastomas had 
barely detectable p130 yet prominent p107 (Fig. 4b and Extended Data 
Fig. 6b), implicating p130 but not p107 in retinoblastoma suppression. 
Concordantly, co-knockdown of p130 with Rb increased cone precursor 
proliferation (Fig. 3a and Extended Data Fig. 5a, d) and p130 overex- 
pression suppressed cone precursor and retinoblastoma cell prolifera- 
tion (Fig. 3b, c, e). Meanwhile, p107 knockdown suppressed proliferation 
both in Rb-depleted cone precursors (Fig. 3a and Extended Data Fig. 5a, d) 
and in retinoblastoma cells (Fig. 3d, e and Extended Data Fig. 5e, g). In 
Y79 cells, p107 knockdown decreased expression of MYCN and SKP2, 
while it increased the SKP2 target, p27 (Fig. 3e). These effects were seen 
with two shRNAs and were rescued by p107 restoration (Fig. 3d, e and 
Extended Data Fig. 5d-i). Furthermore, p107 overexpression enhanced 
proliferation of retinoblastoma cells while suppressing that of neuro- 
blastoma cells (Extended Data Fig. 5h-j). Thus, both in Rb-depleted 
cone precursors and in retinoblastoma cells, p130 suppressed prolif- 
eration whereas p107 had a proliferative role distinct from its function 
in mouse models. 
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Figure 3 | Effects of cone precursor circuitry on response to Rb depletion. 
a, Prevalence of Ki67* or CC3° cells (top) and cell numbers (bottom) after 
shRNA transduction of isolated cone precursors. b, Isolated cone precursor 
response to co-transduction with RB1 shRNA and BE-Neo (BN) vector, 
BN-p130 or BN-SKP2. c-e, Effect of p130 overexpression or p107 knockdown 
on Y79 cell proliferation (c, d) and protein expression (e). *P < 0.05, 

**P <).01 (compared to scrambled and vector controls); {P < 0.05, 

+TP < 0.01 (compared to RB1 shRNA plus SCR (a), or to RBI shRNA plus BN 
vector (b)). Results represent at least two independent experiments. Values and 
error bars denote mean and s.d. of triplicate assays; P values are from 
unpaired Student’s t-test. 
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Figure 4 | Rb-depleted or Rb/p130-depleted cone precursor tumours. 

a, Haematoxylin-and-eosin-stained Rb-depleted and Rb/p130-depleted cone 
xenograft tumours and human retinoblastoma (n = 4). KD, knockdown. 
Dashed lines denote Flexner—Wintersteiner rosettes; solid lines denote 
fleurettes; asterisks mark rosette cavities. b, Cone- and cell-cycle-related protein 
expression in human retinoblastoma and cone xenografts (n = 6). Scale bars, 
40 um (a, b). c, Transmission electron microscopy of Flexner-Wintersteiner 
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After several months, some cone precursor cultures depleted in Rb 
or in both Rb and p130 (Rb/p130-depleted) formed suspension aggre- 
gates resembling retinoblastoma cells (Extended Data Fig. 7a). Rb/p130- 
depleted cultures proliferated more robustly and longer than those with 
Rb depletion alone, consistent with p130 losses in many retinoblastoma 
cell lines (Extended Data Fig. 6a). The cultures had properties consistent 
with Rb/p130-depleted cone precursors (Extended Data Fig. 7b-h). When 
engrafted either 3 months or within 1 week after knockdown, Rb- or 
Rb/p130-depleted cone precursors formed retinoblastoma-like tumours 
in subretinal xenografts (Fig. 4a and Extended Data Figs 8 and 9). For 
cells engrafted within 1 week, tumours appeared within 6-14 months 
(Extended Data Fig. 8b), similar to the time needed to form tumours 
in children. 

Cone-precursor-derived tumours had differentiated histology, little 
Rb or p130, many Ki67~ cells, and prominent p107 and SKP2, consis- 
tent with robust proliferation (Fig. 4a, b and Extended Data Fig. 9). They 
expressed the photoreceptor-related CRX, CD133 and IRBP and the cone- 
specific cone arrestin, L/M-opsin and RXRy; all at levels similar to reti- 
noblastomas and developing retinas (Fig. 4b and Extended Data Fig. 9). 
This is consistent with the many cone-specific proteins in retinoblas- 
toma tumours’ (Supplementary Table 1). As in human retinoblastomas’, 
cone-precursor-derived tumour cells lacked numerous markers of other 
retinal cell types and had rare S-opsin and rhodopsin expression (Ex- 
tended Data Fig. 10). Rb-depleted and Rb/p130-depleted cone precursor 
tumours also had structures resembling Flexner—Wintersteiner rosettes 
and fleurettes (Fig. 4a), which are retinoblastoma hallmarks’. Trans- 
mission electron microscopy confirmed the rosettes, with mitochon- 
dria positioned between the nuclei and rosette lumens (Fig. 4c). Dense 
core vesicles were not seen in two Rb-depleted cone precursor tumours 
nor in two retinoblastomas, consistent with the reported rarity of such 
structures”®””, Finally, single nucleotide polymorphism (SNP)-array ana- 
lyses of two tumours revealed no megabase-size gains or losses, whereas 
quantitative PCR (qPCR) analyses revealed a partial RB1 loss but no 
other frequently reported changes (Extended Data Fig. 8c—e), consistent 
with the lack of DNA copy number alterations in some retinoblastomas*™*. 
Thus, cone precursor tumours resembled human retinoblastomas at 
the histological, ultrastructural, retinal marker and molecular cytoge- 
netic levels. 
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rosettes in a human retinoblastoma and a cone-derived tumour, with 
mitochondria (arrows) between nuclei and rosette cavity (n = 2). The X25,000 
images are from the boxed area (top) or from a rosette not shown (bottom). 
Results are representative of at least two experiments. d, Model of cone- 
precursor retinoblastoma origin highlighting proteins that suppressed (blue) or 
promoted (red) the proliferative response. 
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This study examined collaboration between Rb loss and retinal cell- 
type-specific circuitries. We found that the circuitry of maturing L/M- 
cone precursors was uniquely conducive to proliferation and development 
of retinoblastoma-like tumours. Although we cannot exclude the pos- 
sibility that Rb loss could induce a cone program and proliferation in 
other cell types, the robust responses of the most highly enriched cone 
precursor populations and of cells in an intact fovea suggest that cone 
precursors are the primary if not the sole responding cell type. Cone 
precursor features that collaborated with Rb loss included cone lineage 
factors (TRB2 and RXRy), highly expressed oncoproteins (MYCN and 
MDM2), and p27 downregulation probably mediated by SKP2. Some 
of these features may be interdependent, as RXRy promoted MDM2 
expression’, yet the larger program encompassing these features and its 
developmental purpose are unknown. Importantly, Rb-depleted cone 
precursor tumours had differentiated histology and lacked gross DNA 
aberrations, similar to putative early retinoblastoma elements”’. These 
findings support a model in which Rb-deficient cone precursors form 
differentiated retinoblastomas, then dedifferentiate (Fig. 4d) and pos- 
sibly acquire non-cone features*”’. Much of the circuitry implicated in 
cone precursor tumour initiation was also needed for retinoblastoma 
cell proliferation’”*, suggesting that tumour cells can be addicted to the 
cancer-predisposing circuitry of their originating cell types. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Retinoblastoma and retinal cell culture. Retinoblastoma cell lines Y79 and Weri- 
RB1 were obtained from the ATCC. RB177 was from an early passage culture and 
its identity confirmed by RBI mutation sequencing. Retinoblastoma cells were con- 
firmed free of mycoplasma and cultured in RB culture medium (IMDM, 10% FBS, 
55 UM B-mercaptoethanol, with glutamine, penicillin, streptomycin, fungizone and 
10 yg ml * plasmocin (Invivogen)’”*). Fetal eyes were obtained with informed con- 
sent from the Human Fetal Tissue Repository of the Albert Einstein College of Med- 
icine and from Advanced Bioscience Resources under protocols approved by the 
Memorial Sloan-Kettering Cancer Center (MSKCC) Institutional Review Board, 
the Albert Einstein College of Medicine Institutional Review Board, and the Children’s 
Hospital Los Angeles Committee on Clinical Investigations. After transport in IMDM 
with 10% FBS on ice, eyes were rinsed in 70% ethanol for 3 s and washed in sterile PBS. 
Eyes were opened using a sterile scalpel and lens removed. Retinas were detached using 
forceps and incubated in papain solution (Worthington Tissue Dissociation Kit) for 
10-30 min at 37 °C and 5% CO,, with pipette mixing every 5 min. After dissociation 
to ~20-cell clusters, cells were diluted with 10 volumes of PBS and collected by cen- 
trifugation at 2,000 r.p.m. (Sorvall, Legend RT), re-washed in PBS (all centrifuga- 
tions at 2,000 r.p.m. unless otherwise stated), suspended in RB culture medium as 
above, incubated at 37 °C with 5% CO, overnight, and frozen in RB culture medium 
containing 10% dimethylsulphoxide (DMSO). Supernatant was transferred into a 
sterile container after each centrifugation and re-spun to prevent retinal cell losses. 
For lentivirus infections, cells were recovered from liquid nitrogen, cultured over- 
night, washed with PBS, suspended in 0.05% trypsin/EDTA (Cellgro) for 3-10 min 
with gentle pipetting, re-centrifuged, suspended in RB culture medium as above, and 
immediately infected. Cultures were maintained at high density, typically 50,000 
cells per well (24-well dish) for unsorted cultures, with media changes every 3 days. 
FACS. Approximately 10 million dissociated retinal cells (~5 million per retina) were 
cultured for 18 h after thawing in RB culture medium, collected by centrifugation, 
washed with PBS, digested with 5 ml warm 0.05% trypsin/EDTA for 5-15 min while 
triturating in a 24-well culture plate 20-30 times per minute using a 1,000-l tip 
and checking every ~3 min, to produce 90-95% single cells, centrifuged as above 
(retaining the supernatant to prevent cell loss), suspended in 400 11 5% FBS in PBS, 
and incubated at room temperature for 10 min. Cells (100 pul) were combined with 
100 jl 4 ug ml * of mouse IgG (Sigma, I-8765), 300 pil of cells were combined with 
300 il of pre-mixed anti-CD133-phycoerythrin (PE) (Miltenyi Biotec, 130-080-801) 
at 1:6 and anti-CD44 fluorescein isothiocyanate (FITC) (clone IM7, Abcam ab19622 
or BD Biosciences, BDB553133) at 1:25, to give 1:12 CD133 and 1:50 CD44 final 
dilutions. After 1 h at room temperature, cells were diluted with 900 115% FBS in 
PBS, centrifuged as above, suspended in 500 11 5% FBS/PBS with 300 ng ml ~ 1 DAPI 
and held on ice until sorting. Cells were sorted using a Becton-Dickenson FACSAria 
SORP with 100 mW 488-nm laser, the triple bandpass filter removed in the FITC 
channel, FACSDiva v8.0 software, and selecting live single cells based on FSC width, 
SSC width, and DAPI exclusion. On FSC/SSC plots, cells were divided into small, 
medium, and large size groups and evaluated for CD133-PE and CD44-FITC. Eight 
populations collected into 500 ,1l complete medium as above were small, medium and 
large CD133"!, CD447; small, medium, and large CD133", CD44‘; small CD133~, 
CD44; and ungated live single cells. Each population was cultured in 50% Y79- 
conditioned medium with fungizone (50% fresh RB culture medium combined 
with 50% filtered Y79-conditioned RB culture medium), and half of the volume 
changed with fresh 50% Y79-conditioned medium every 3 days. Sorted popula- 
tions were characterized by adhering cells to poly-L-lysine-coated coverslips (1,000- 
2,000 cells each) for 3 h, fixing in 4% paraformaldehyde (PFA) for 5 min, washing 
in PBS four times, and storing at —20 °C until immunostaining. Lentivirus infec- 
tion was performed within 24h after sorting. 

Lentiviral shRNA and cDNA expression constructs. pLKO lentiviral shRNA 
vectors from the TRC library (Open Biosystems/Thermo Scientific or MSKCC SKI 
High-Throughput Drug Screening and RNAi Core Facility)’ were designated by 
‘sh’ followed by the name of the target gene and last 3-4 digits of the TRC or SKI 
identification numbers (Supplementary Table 2). shRNAs directed specifically against 
THRB variant 2 (also known as TR#2) were designed using Invitrogen BLOCK-iT 
RNAi Designer (http://rnaidesigner.invitrogen.com/rnaiexpress/) and siDirect (http:// 
sidirect2.rnaijp/doc/) and cloned using the TRC cloning strategy (http://www.addgene. 
org/pgvecl?f=v&cmd=showfile&file=protocols) with deoxyoligonucleotides for 
DNA-directed RNAi (Integrated DNA Technologies). They are designated accord- 
ing to the position of the first shRNA target nucleotide after the translation ini- 
tiation site (Supplementary Table 2). The pLKO scrambled control was Addgene 
plasmid 1864 (ref. 28). pLKO-YFP-shRB1-733, pLKO- YFP-shRB1-737 and pLKO- 
YFP-SCR control virus were produced by replacement of the puromycin resistance 
gene with YFP complementary DNA using In-Fusion cloning (Clontech), and pro- 
vided by Z. Li. The lentiviral CDNA expression BN vector was created by replacing 
the EGFP gene of the BE-GFP lentiviral vector” with the neomycin resistance gene 
between the EcoRI and BamHI sites (with assistance of S. Puranik). BN-p130 was 
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produced by inserting human p130 cDNA between the BE-Neo PshAI and Xbal 
site. BN-SKP2 and BN-p107 were produced by inserting human SKP2 and p107 
cDNA, respectively, between the BsiW] and PspXI sites of BE-Neo. Because shRBL1- 
2621 (shp107-1) targets the 3’ untranslated region, only the RBL1 open reading frame 
was cloned into BE-Neo to produce shRBL1-2621-resistant BN-p107. To produce 
shRBL1-2623 (shp107-2)-resistant BN-p107-2r, the shRBL1 target sequence GC 
AGTGAATAAGGAGTATGAA was mutated to GCAGTAAACAAAGAATAT 
GAA without amino acid sequence changes using In-Fusion cloning (Clontech). 
BE-p27 was as described’. 
Lentivirus production and infections. Lentiviruses were produced by reverse 
transfection of suspended 2 X 10’ 293T cells using 20 jg lentiviral vector, 10 Lg 
pVSV-G, 20 tg pCMV-dR8.91 (ref. 30) and 100 il Polyjet (SignaGen) or Lipofec- 
tamine 2000 (Life Technologies) in 15 cm? dishes. The 3-ml plasmids-Polyjet com- 
plex and 1.5 ml 293T cell suspension were mixed in 50 ml centrifuge tubes and 
shaken for 30 min before being transferred to dishes. Virus collected 48 and 72h 
after transfection was combined, concentrated 50-100-fold by centrifugation at 
25,000 r.p.m. for 90 min, and suspended in RB culture medium. Concentrated virus 
(500-2,000 tl) was used to infect 5 X 10° Y79, Weri-1 or RB177 retinoblastoma cells, 
or to infect 5 X 10° total retinal cells or 1 X 10° of each sorted retinal cell population 
in 500 ll of filtered conditioned RB culture medium in the presence of 44g ml 
polybrene (Sigma-Aldrich) followed by gentle pipetting 25 times and shaking for 
10 min in the hood. After 18 h, cells were diluted in an equal volume of conditioned 
RB culture medium and maintained at 37 °C with 5% CO . For co-infections, 100 pl 
of each concentrated virus was used to infect either 1 X 10* total retinal cells or 1 X 10° 
sorted retinal cells suspended in 100 pl of conditioned RB culture medium with 
41g ml" polybrene ina total volume of 300 pil, and medium was replaced with 150 jl 
50% Y79 and other RB cell conditioned medium 24 h after infection. Infected cells 
were selected starting 48 h after infection with 1.4-3 ig ml’ puromycin for 48- 
72 hor with 50-100 pg ml! G418 for 4-7 days, and fed every 2-3 days by replacing 
two-thirds of the media with 50% Y79 and other RB cell-conditioned medium. 
Intact FW19 retinas were either infected within the globe, by cutting a cross-section 
through the cornea, removing the lens and most of the vitreous, and pipetting 500 il 
of concentrated pLKO versions of shRB1-733 and shRB1-737 or scrambled control 
lentivirus into the sub-retinal space and vitreous (causing retinal detachment) in a 
24-well plate with the globe submerged in RB culture medium with 1 ml lentivirus 
suspension, followed after 2 days by addition of 2 ml of freshly prepared concen- 
trated lentivirus; or infected after removal of the intact retina and residual vitreous 
in a 12-well plate, by addition of 1 ml of 80X concentrated pLKO-YFP-shRB1-733 
or scrambled control lentivirus, reinfection with the same viruses 1 and 3 days later, 
and changing 50% of medium with a 1:1 mixture of fresh and ocular globe-conditioned 
medium daily thereafter. Displaced retinal tissue was fixed with 2% PFA in PBS for 
2hat 4 °C, and eyes with remaining tissue were fixed in 2% PFA in PBS overnight 
at 4 °C. Tissue samples were washed with PBS, transferred to 30% sucrose in PBS, 
and embedded in 30% sucrose/PBS:OCT at a 2:1 ratio, and cryosectioned at 8-10 jum. 
Real-time quantitative PCR. Total RNA was isolated using StrataPrep total RNA 
microprep kit (Stratagene) for <1,000 cells (in FACS isolated populations) or GenElute 
Mammalian Total RNA Miniprep Kit (Sigma) for all other analyses. cDNA was 
synthesized using ImProm-II Reverse Transcription System (Promega). Primers 
were designed by Beacon Designer software (Premier Biosoft International) or Primer3 
(http://frodo.wi.mit.edu/primer3/) (Supplementary Table 3). Relative messenger 
RNA levels were determined by qPCR using QuantiTect SYBR Green PCR Kit 
(Qiagen) or Maxima SYBR Green qPCR Master Mix (Fermentas) on an Applied 
Biosystems ABI 7900HT Sequence Detection System or ViiA 7 Real-Time PCR 
System using 95 °C 10 min followed by 40 cycles of 95 °C 20 s, 54 °C 30 s, 72 °C 30s. 
Each sample was evaluated in triplicate and normalized to ACTB and GAPDH. 
Values represent the averages of both normalized results and error bars the stan- 
dard deviation. 
Immunostaining. Antibodies are described in Supplementary Table 4. Eyes were 
prepared and cryosectioned as described”’*. Cultured retinal cells were dissociated 
by gentle triturating, spread on poly-.-lysine-coated slides, incubated in a humidi- 
fied incubator at 5% CO, and 37 °C for 3h, fixed in 4% PFA and PBS for 5 min, 
gently rinsed with PBS four times, vacuum-dried for 5 min, and stored at —20 °C. 
The following co-staining combinations and orders were used to assess Ki67 expres- 
sion in different retinal cell types. For cones: la. Mouse anti-cone arrestin*!, anti- 
mouse-biotin, streptavidin-FITC, rabbit anti-Ki67, anti-rabbit-Cy3, rabbit anti-CRX 
and anti-rabbit-Cy5. 1b. Mouse anti-Ki67, anti-mouse-Cy3, rabbit anti-CRX, anti- 
rabbit-FITC, rabbit anti-human cone arrestin” and anti-rabbit-Cy5. 2. Mouse anti-Ki67, 
anti-mouse-biotin, streptavidin-FITC, rabbit anti-CRX, anti-rabbit Cy3, rabbit anti- 
L/M-opsin and anti-rabbit-Cy5. 3. Mouse anti-RXRy, anti-mouse-biotin, streptavidin- 
FITC, rabbit anti-Ki67, anti-rabbit-Cy3, rabbit anti-CRX and anti-rabbit-Cy5. For 
progenitors, Miiller, and horizontal amacrine cells: mouse anti-Pax6, anti-mouse- 
biotin, streptavidin-FITC, rabbit anti-Ki67, anti-rabbit-Cy3, rabbit anti-nestin and 
anti-rabbit-Cy5. For other retinal cell types: mouse anti-human Kié67, anti-mouse-biotin, 
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streptavidin-FITC, and rabbit antibodies for retinal specific markers, anti-rabbit 
Cy3. For BrdU labelling, 10 .M BrdU was added to medium for 2 h on day 23 after 
Rb knockdown and cells were stained with rat anti-BrdU, anti-rat-FITC, rabbit 
anti-CRX, anti-rabbit-Cy3, rabbit anti-L/M-opsin and anti-rabbit-Cy5. 

For co-staining with mouse antibodies, sections or cells were treated with 1 mM 
EDTA/PBS for 5 min at room temperature and washed with PBS. Sections were 
treated with ABC kit reagent A (Vector Laboratories) in PBS for 15 min, washed in 
PBS, treated with ABC kit reagent B (Vector Laboratories) in PBS for 15 min, washed 
in PBS, blocked and permeabilized for 20 min in super block (2.5% horse serum, 
2.5% donkey serum, 2.5% human serum, 1% BSA, 0.1% Triton-X-100 and 0.05% 
Tween-20 in PBS; filtered with 0.22 tm filter), incubated in mouse primary anti- 
body in super block overnight at 4 °C, washed in PBS, incubated in biotinylated 
horse anti-mouse antibody in super block for 30 min, washed in PBS, incubated 
with FITC-conjugated streptavidin in PBS, and washed with PBS. 

For co-staining with antibodies of other species, after completing the first stain- 
ing reaction as above, sections were incubated in super block for 20 min, incubated 
overnight with primary antibody in super block, washed in PBS, incubated with Cy3- 
or Cy5-conjugated secondary antibody in super block for 30 min, and washed in 
PBS. Sections were then stained with 1 pg ml | DAPI in PBS, dried, mounted in 
Vectashield (Vector Labs) and imaged using an Axioplan2 (Carl Zeiss Microlmaging, 
LLC) or confocal DMIRE2 (Leica) microscope. Antibody specificity was confirmed 
by staining in parallel with control IgG or no primary antibody. 

Antibody-dependent immunofluorescence signals were distinguished from auto- 

fluorescence by virtue of signal detected in only one colour channel. Cells with auto- 
fluorescence in multiple channels or with DNA condensation, fragmentation, or 
degradation were excluded. Cytoplasmic autofluorescence common in astrocytes, 
Miller glia, and ganglion cells was distinguished from authentic antigens by its 
detection at multiple wavelengths in cells with characteristic glial cytoplasmic and 
nuclear morphology. Nonspecific cytoplasmic staining by concentrated nestin anti- 
body was distinguished from authentic nestin staining by its homogeneous rather 
than fibre-like structure. Nonspecific nuclear staining of L/M-opsin was avoided by 
using reduced antibody concentration. 
EdU labelling and detection. Click-iT EdU Alexa Fluor 488 Imaging Kit was used 
for EdU labelling to detect proliferation. Dissociated or sorted retinal cells were 
infected with shRB1 or control lentivirus. After 14 days, 20 1g ml~ 1 of EdU was added 
into medium and incubated for 1h; the cells were attached on poly-1-lysine-coated 
coverslips for 2h and fixed for 5 min. 

The cells were blocked and permeabilized for 20 min in super block as above 

and EdU was detected by addition of Click-iT reaction cocktails containing 2 1M 
Alexa Fluor 488 azide for 1 h. Co-staining was performed after EdU labelling with 
different combination of antibodies for retinal cell markers and secondary antibodies 
conjugated with Cy3 or Cy5, described as above. For cones, cone arrestin-Cy3 plus 
CRX-Cy5, RXRy-Cy3 plus CRX-Cy5, and L/M-opsin-Cy3 plus CRX-Cy5 were 
used for co-staining with EdU. 
Immunoblotting. Cells were washed in PBS, lysed in ELB+ (150 mM NaCl, 50 mM 
HEPES, pH 7.4, 0.1% NP40, 5mM EDTA, 2 mM dithiothreitol (DTT), 1 mM phe- 
nylmethylsulfonyl fluoride, 10 mM NaF, 1 mM sodium orthovanadate, Thermo 
Scientific Halt phosphatase inhibitor cocktail and protease inhibitor cocktails), sep- 
arated on 4-20% Ready Gel polyacrylamide gels (Jule Biotechnologies INC) or 8% 
polyacrylamide (for Rb western), and transferred to Hybond-ECL nitrocellulose 
membrane (Amersham). Membranes were probed with antibodies (Supplementary 
Table 4) and developed using horseradish peroxidase-conjugated anti-mouse or 
anti-rabbit antibodies and the ECL Advance Western Blotting Detection Kit (Amer- 
sham Biosciences) or Thermo Scientific SuperSignal-West Femto Chemilumines- 
cent Substrate, and HyBlot CL X-Ray film (Denville Scientific). 


Xenografts. All animal experiments complied with ethical regulations and were 
approved by the MSKCC Institutional Animal Care and Use Committee. Xenografts 
were performed on 6-week-old male athymic (Foxn1 ~/~) mice (Taconic) or 6-week- 
old male non-obese diabetic-severe combined immunodeficient (NOD-SCID) 
I larg ’ ~ mice (Jackson Laboratories). Cultured cells were dissociated by pipetting, 
suspended in RB growth medium at 5 X 10* cells per microlitre (day 90) or 2.5 X 10° 
cells per microlitre (days 3 or 7), held on ice, and 2 1] injected into the subretinal space 
as described’. Irradiated 5053 rodent diet with amoxicillin was provided from 2 days 
before to 2 weeks after injection to prevent infection. Some tumour-bearing eyes 
were fixed and embedded as described”’*. 

DNA copy number analyses. Genomic DNA from retinoblastomas, cone-derived 
retinoblastoma-like cells, and cone-derived xenograft tumours were isolated with 
QiaAMP DNA Mini kit (Qiagen). Genomic DNA of cone-derived cells was digested 
with XhoI to separate pLKO DNA hairpin structures. Relative DNA levels were 
determined in triplicate by qPCR using QuantiTect SYBR Green PCR Kit (Qiagen) 
onan Applied Biosystems ABI 7900HT Sequence Detection System or ViiA 7 Real- 
Time PCR System, using primers listed in Supplementary Table 5 and normalizing 
to the average of the HNF4A and BRCAI genes. Integrated pLKO-shRB1-733, shRB- 
737 and shRBL2-923 copy numbers were analysed using primers corresponding 
to the pLKO.1 U6 promoter and RB1- or RBL2-specific shRNA sequences. High 
resolution SNP-array DNA copy number analyses were performed using CytoScan 
HD (Affymetrix, 901835). Data were analysed using Chromosome Analysis Suite 
2.0 (Affymetrix). 

Statistical analyses. Measurements were performed in triplicate and differences 
between means assessed for significance using unpaired Student’s t-tests. Sample 
sizes were chosen based on the maximum cell numbers that could be used for indi- 
vidual experiments given sample availability. 

Transmission electron microscopy. Human retinoblastomas and cone-derived 
xenograft tumours were fixed with 4% PFA in PBS, rinsed in 0.1 M sodium caco- 
dylate buffer, post-fixed in 2% osmium tetroxide for 1h, rinsed in distilled water, 
dehydrated in a graded series of 50%, 75%, 95% and 100% ethanol, followed by two 
10-min incubations in propylene oxide and overnight incubation in 1:1 propylene 
oxide/Poly Bed 812. The samples were embedded in Poly Bed 812 and cured at 
60 °C. Ultra-thin sections were obtained with a Reichert Ultracut S microtome. 
Sections were stained with Uranyl Acetate and Lead Citrate and photographed using 
a Jeol 1200EX transmission electron microscope. 
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Extended Data Figure 1 | Proliferation of cone-like cells after Rb depletion 
in dissociated FW19 retina. a, Decreased Rb protein in L/M-opsin* or TRB2~ 
cells (arrows) on days 5 or 23, and decreased RBI RNA or Rb protein on 

day 4 after shRB1-733 transduction. b, Cone arrestin’, CRX* cells (arrows) 
with or without Ki67 co-expression. ¢, Ki67* and cone arrestin’ cells first 
detected 9 or 14 days after transduction in two experiments. d—f, Co-staining 
of Ki67 with RXRy and CRX at 14 days (d), with cone arrestin and CRX at 
14 days (e), or with L/M-opsin and CRX at 23 days (f) after transduction with 
shRB1-733 or a scrambled control. g, Percentage of cells co-expressing Ki67 
with L/M-opsin and CRX, RXRy and CRX, or cone arrestin and CRX, 23 days 
after transduction. h, Prevalence of cells co-staining for L/M-opsin and CRX, 
RXRy and CRX, or cone arrestin and CRX, 23 days after transduction. 

i, Ki67 not detected in cells expressing markers of rods (NRL), ganglion cells 
(BRN-3), bipolar cells (strong CHX10), or horizontal cells (PROX1) 14 days 


after transduction. j, Co-expression of Ki67 with markers of RPCs (nestin, 
white arrows) or Miller glia (CRALBP or SOX2), but not in PAX6", 

nestin ganglion, amacrine or horizontal cells (yellow arrows) 14 days after 
transduction. k, 1, EdU incorporation in cells expressing markers of cones (cone 
arrestin and CRX or RXRy and CRX, yellow arrows in 1) but not in cells 
expressing markers of rods (CNGA1, CNGB1), bipolar cells (CHX10, CRX), or 
ganglion, horizontal or amacrine cells (syntaxin) (white arrows in 1) 14 days 
after transduction. Black lines above labels demarcate distinct fields. 

m, Co-staining of phosphohistone H3 (PH3) with cone arrestin and CRX 

23 days after transduction. n, Apoptosis marker CC3 in cells expressing RPC 
and glial marker nestin 14 days after transduction with RB1-directed shRNAs 
(yellow arrow) but not with scrambled control (white arrow). Values and error 
bars are mean and s.d. of triplicate assays. Scale bars, 20 1m. Data are 
representative of at least two independent experiments. 
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Extended Data Figure 2 | FACS isolation of retinal cell populations. Retinal large CD133"! CD44 ~ population isolated as in i-k with two RBI shRNAs 
cells were isolated according to size, CD133 and CD44 staining. In study 1, cell (yellow arrows) but not with the scrambled control (white arrows). In both 


studies, CD133™ CD44~ medium and large size populations mainly consisted 
of cells expressing cone markers (CRX and cone arrestin, or CRX and RXRy). 
The CD133" CD44~ small population mainly consisted of cells expressing a 
rod marker (NRL) with a variable proportion expressing cone markers. All 
CD133° CD44* populations mainly consisted of cells co-expressing RPC 
and glial markers (nestin and PAX6, nestin and CHX10, or CRALBP). The 
CD133'° CD44” small size population consisted of cells with diverse 
immunophenotypes. Values and error bars are mean and s.d. of triplicate 
assays. Scale bars, 30 um. 


type compositions in each fraction (a) were determined by immunostaining 
with cone arrestin and CRX (b, e), NRL (c), and nestin and PAX6 (d, f). In 
study 2, cell type compositions (i) were determined by immunostaining with 
RXRy and CRX, nestin and CHX10, nestin and PAX6, and CRALBP (j, k). The 
percentages of the predominant cell types in each population (a, i) and marker 
specificities (g) are indicated. h, Cone-specific co-staining of cone arrestin and 
GNAT2 (top) and cone-specific co-staining of RXRy and CRX (bottom) in 
FW19 retina. GCL, ganglion cell layer; INL, inner nuclear layer; ONL, outer 
nuclear layer. 1, Co-staining of cells for EdU with cone arrestin and CRX or with 
RXRy and CRX 14 days after transduction of the cone-enriched medium plus 
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a mixture of shRB1-733 and shRB1-737, or with scrambled control. *P < 0.05, 
**P < 0.01 (comparing shRB1 and scrambled control). Data are representative 
of two sets of qPCR analyses. Values and error bars are mean and s.d. of 
triplicate assays. 


Extended Data Figure 3 | Cone precursor gene expression response to Rb 
depletion. a-c, Fold change in RNA level relative to day 0 uninfected cells 
for RBI (a), or the indicated E2F-responsive genes (b), or the indicated 
p53-regulated genes (c), 3 and 6 days after transduction of each population with 
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Extended Data Figure 4 | Proliferation status of retinal cells other or YFP”, cone arrestin” cone precursors (white arrows) and in an undefined 
than cones 15 days after shRB1 transduction of intact FW19 retinas. YFP cell (yellow arrow). f, Ki67 not detected in YFP", calbindin* horizontal 
a-d, Combined transduction with pLKO-shRB1-733 and -737. a, Ki67 not cells, YFP*, syntaxin™ or YEP‘, PAX6* amacrine cells, or in YFP*, NRL* 
detected in NRL* or rhodopsin™ rod photoreceptors or in calbindin* rod precursors. g, Ki67 detected (white arrows) or not detected (yellow arrows) 
horizontal cells. b, Ki67 detected in PAX6"®, nestin* RPCs (white arrows) in YFP‘, nestin’ RPCs or glia, or in YFP‘, CHX10* RPCs or bipolar cells. 
but not in PAX6", nestin” horizontal, amacrine or ganglion cells (yellow h, Proportion of Ki67* cells co-expressing YFP and retinal markers after 
arrows). ¢, Ki67 detected in CHX10*, CRX” RPCs (white arrows) but not transduction with pLKO-YFP-shRB1-733 or scrambled control. Values and 
in CHX10*, CRX* bipolar cells (yellow arrows). d, Percentage of cells error bars are mean and s.d. of triplicate assays. Scale bars, 20 um. Analyses in 
co-expressing Ki67 and retinal cell markers. e-h, Transduction with a-d and in e-h represent two independent experiments. All immunostaining 


YFP-marked pLKO-YFP-shRB1-733. e, Ki67 detected in YFP*,1/M-opsin* was performed at least twice. 
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Extended Data Figure 5 | Effect of cone- and Rb-related circuitry on cone 
precursor response to Rb depletion. A, Percentage of Ki67* cells among 
L/M-opsin™, CRX* cells (a), among RXRy*, CRX* cells (b), or among cone 
arrestin’ , CRX cells (c); and percentage of L/M-opsin™, CRX* cells among 
all cells with DAPI* nuclei (d) after transduction of dissociated FW18 retina 
with shRB1-733 and shRNAs against p130, p107, TRB2, SKP2, MDM2 and 
MYCN. B, Percentage of Ki67~ cells among L/M-opsin*, CRX* cone-like cells 
(top) and proliferative response (bottom) after transduction of dissociated 
FW18 retina with shRB1-733 and with shRNAs against RXRy and p27 
(shRNAs 856+930), or with overexpression of p27 and p27-T187A. 

C, High-level Thr 187 phosphorylated p27 (p-p27(T187), top) coinciding 
with downregulation of total p27 (bottom) and prominent Rb during cone 
precursor maturation. C, a, Perifoveal region of FW18 retina. C, b, Enlarged 
view of boxed regions in C, a. Arrows, cone precursors identified by large, 
strongly Rb™ nuclei and lack of p27 signal in characteristic outer nuclear layer 
position”’*. D, Effect of two RBL1-p107 or two RBL2-p130 shRNAs on 


LETTER 


proliferation of Rb-depleted isolated cone precursors. E, Knockdown efficacy of 
two RBL1-p107 or two RBL2-p130 shRNAs in Y79 and RB177 retinoblastoma 
cells. F, Impaired proliferation of Weri-RB1 retinoblastoma cells after 
transduction with BN-p130 compared to vector control. G, Impaired 
proliferation of RB177 retinoblastoma cells following transduction with two 
p107 shRNAs. H, I, Impaired proliferation and MYCN expression in Y79 cells 
after p107 knockdown with two p107-directed shRNAs, and rescue by shRNA- 
resistant BN-p107 constructs. J, p27 accumulation and growth suppression 
following p107 knockdown with shp107-2 rescued by BN-p107-2r in RB1 wild 
type SKN-BE(2) neuroblastoma cells. p107 overexpression impaired SKN- 
BE(2) growth, contrary to its effects in Y79. *P < 0.05, **P < 0.01 (compared 
to SCR or vector control); fP < 0.05, ff P < 0.01 (compared to RB1-KD plus 
SCR or RB1-KD plus BN vector); {P < 0.05, {£P < 0.01 (compared to 
shp107-2 plus BN vector) (H-J). Data are representative of more than two 
independent experiments except for SKN-BE(2) analyses. Values and error 
bars are mean and s.d. of triplicate assays. 
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Extended Data Figure 6 | p130 copy number in retinoblastomas and cone 
precursor expression. a, DNA copy number of p130, other 16q genes 
implicated in retinoblastoma (CDH11, CDH13), and p107 determined by qPCR 
(n = 6). The percentage of retinoblastomas with copy number (CN) < 1.5 
was higher for p130 than for other 16q genes (summarized at right; P values 
relative to p130 using Fisher’s exact test). b, p130 in peripheral, lateral and 
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central FW19 retina. Boxed region in maturing central retina (top) and 
enlarged view (bottom) show prominent p130 in weakly DAPI-stained cone 
precursor nuclei (arrows). Scale bars, 40 um. Data are representative of at least 
two independent experiments. Values and error bars are mean and s.d. of 
triplicate assays. 
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Extended Data Figure 7 | Characterization of Rb/p130-depleted 
retinoblastoma-like cells. a, Similar appearance of Rb/p130-depleted cones 
and early passage retinoblastoma cells. Scale bar, 40 jum. b, c, DNA copy 
number of shRNA vectors (b) or selected genes (c) in cell lines derived from 
Rb/p130-depleted cone precursors (Conel, Cone2, Cone5) or from 
Rb/p130-depleted unsorted retinal cells (All3, All4), in Rb-depleted unsorted 
retinal cells 4 days after transduction (All-RB1-KD-d4), in Y79 cells, or in 
FW21 retina (normal) (nm = 6). All cell lines retained RBI and p130 shRNA 
vectors and lacked RB1 or p130 copy number alterations. The Y79 MYCN copy 
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number (~78) is not shown (asterisk). d-g, qPCR gene expression analyses 
in the indicated cell lines relative to cones transduced with scrambled control 
or FW21 retina (n = 6). d, All cell lines had diminished RBI and p130 
expression. e-g, Altered expression of cell-cycle-related (e), cone-related (f) 
and apoptosis-related (g) genes. h, SNP-array analysis of two Rb/p130-depleted 
cone precursor cell lines (1, 2), revealing no megabase-size loss of 
heterozygosity (LOH) or copy number alterations (CNA). Data are 
representative of at least two analyses (b-g) or analyses of two cell lines (h). 
Values and error bars are mean and s.d. of triplicate assays. 
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Extended Data Figure 8 | Characterization of Rb- and Rb/p130-depleted 
cone precursor tumours. a, Intraocular tumour 4 months after Rb-depleted 
cone precursor xenograft. b, Summary of subretinal xenograft groups 1, 2 and 3. 
Sample size was as needed to assess tumour phenotypes. Mice were randomly 
assigned to different xenograft regimens and the investigator blinded to the 
assignment until the tumour analyses. Two mice with early death were 
excluded from the analyses. c, SNP-array analysis of one Rb/p130-depleted 
(tumour 1) or one Rb-depleted (tumour 2) cone-precursor-derived tumours 
from xenograft group 3, revealing no megabase-size loss of heterozygosity or 
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copy number alterations . d, qPCR analysis of pLKO shRNA vector copy 
number in tumours derived from Rb/p130-depleted cone precursors 
(m-Conel, m-Cone2) or from Rb/p130-depleted unsorted retinal cells (m-All3, 
m-All4), or in mouse ocular tissue (m-Cone-SCR), Y79 cells, or FW19 retina 
(normal). All tumours retained RB1 and/or p130 shRNA vector sequences, 
confirming their engineered cone precursor origin. e, GPCR analysis of MDM2, 
MDM4, RB1 and MYCN copy number in three cone-derived tumours and 
normal retina (n = 6). DNA copy number data (d, e) are representative of two 
analyses. Values and error bars are mean and s.d. of triplicate assays. 
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Extended Data Figure 9 | Cone and cell-cycle-related proteins in Rb- or proliferation-related proteins (Ki67, SKP2, p107 and cytoplasmic p27) but 
Rb/p130-depleted cone precursor tumours engrafted 3 days after lacked Rb. Tumours had elements resembling Flexner-Wintersteiner rosettes 
transduction. Most tumour cells expressed human nuclear antigen (HuNU), (asterisks) and fleurettes (daggers). Scale bars, 40 jum. Data are representative 
confirming their xenograft origin. They also expressed cone-related of three independent experiments. 
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Extended Data Figure 10 | Analysis of non-cone cell markers in cone- 
precursor-derived tumours and retinoblastomas. a, Proteins detected in 
normal retina but not in cone-derived tumour or human retinoblastoma cells 
included markers of rods (rhodopsin and CNGB1), RPCs and Miller glia 
(nestin, GFAP and PAX6), bipolar cells (CHX10), ganglion, amacrine and 
horizontal cells (calbindin and PAX6), and ganglion cells (nuclear BRN-3, thin 
arrows in mouse retina). PAX6", nestin* cells detected in human 
retinoblastoma were previously found to be Rb* non-tumour cells from 
tumour-associated retina’. An uncharacterized cytoplasmic BRN-3 signal (bold 
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Calbindin p130 
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arrows) was detected in mouse photoreceptor outer segments and in cone- 
derived tumour and retinoblastoma rosettes. b, L/M-opsin was detected in 
most cone-derived tumour cells. However, rare cells co-expressed S-opsin and 
L/M-opsin (arrows), as in immature L/M-cone precursors and human 
retinoblastomas’. c, One tumour had rare rhodopsin™, Ki67 cells but no 
detected rhodopsin* , Ki67* cells, as ina previously characterized retinoma- 
like regions’. Scale bars, 40 jum. Data are representative of three independent 
xenograft experiments. 
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Noncoding RNA transcription targets AID to 
divergently transcribed loci in B cells 


Evangelos Pefanis'*, Jiguang Wang'**, Gerson Rothschild'*, Junghyun Lim’, Jaime Chao', Raul Rabadan’*, Aris N. Economides? 
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The vast majority of the mammalian genome has the potential to 
express noncoding RNA (ncRNA). The 11-subunit RNA exosome com- 
plex is the main source of cellular 3’-5’' exoribonucleolytic activity 
and potentially regulates the mammalian noncoding transcriptome’. 
Here we generated a mouse model in which the essential subunit 
Exosc3 of the RNA exosome complex can be conditionally deleted. 
Exosc3-deficient B cells lack the ability to undergo normal levels of 
class switch recombination and somatic hypermutation, two muta- 
genic DNA processes used to generate antibody diversity via the B-cell 
mutator protein activation-induced cytidine deaminase (AID)”’. The 
transcriptome of Exosc3-deficient B cells has revealed the presence of 
many novel RNA exosome substrate ncRNAs. RNA exosome substrate 
RNAs include xTSS-RNAs, transcription start site (TSS)-associated 
antisense transcripts that can exceed 500 base pairs in length and are 
transcribed divergently from cognate coding gene transcripts. xTSS- 
RNAs are most strongly expressed at genes that accumulate AID- 
mediated somatic mutations and/or are frequent translocation partners 
of DNA double-strand breaks generated at Ighin B cells*’. Strikingly, 
translocations near TSSs or within gene bodies occur over regions 
of RNA exosome substrate ncRNA expression. These RNA exosome- 
regulated, antisense-transcribed regions of the B-cell genome recruit 
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Figure 1 | Exosc3-deficient B cells are defective in immunoglobulin 
diversification. a, Exosc3©" allele and conversion to Exosc3-0™”, Cre- 
mediated inversion of lox2372 pair (red triangles) and subsequent deletion via 
loxP pair (violet triangles). GFP-expressing terminal exon is represented by 
green arrow. SA, splice acceptor. b, Southern blot of HindIII-digested genomic 
DNA from 4-OHT-treated (days 2-4), lipopolysaccharide (LPS) plus 
interleukin (IL)-4 stimulated B cells. Probe specific for Exosc3 exon 3. WT, wild 
type. c, (RT-PCR time course of Exosc3 mRNA expression in 4-OHT-treated 
(days 2-4), LPS plus IL-4 stimulated B cells. Indicated Exosc3 genotypes on a 
ROSA260°F®?/* background. Expression levels normalized to cyclophilin 
(Ppia) and plotted relative to untreated Exosc3©°"’". Three technical 
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AID and accumulate single-strand DNA structures containing RNA- 
DNA hybrids. We propose that RNA exosome regulation of ncRNA 
recruits AID to single-strand DNA-forming sites of antisense and 
divergent transcription in the B-cell genome, thereby creating a link 
between ncRNA transcription and overall maintenance of B-cell 
genomic integrity. 

AID mutates single-strand DNA (ssDNA) substrates that form during 
transcription across the B-cell genome. Current DNA targeting models 
propose that AID binds paused/stalled RNA polymerase II complexes 
(RNA Pol II) to access target DNA®. In turn, RNA Pol II associates with 
the pausing/stalling cofactors Spts and RNA exosome, both of which 
stimulate AID function in B cells”. Since RNA exosome is a functional 
component of the stalled RNA Pol II'*"' targeting platform of AID, we 
evaluated RNA exosome’s role in regulating AID activity genome-wide. 
Accordingly, we developed a mouse model containing a conditional inver- 
sion (COIN)” allele of Exosc3, allowing conditional ablation of RNA 
exosome function using tissue-specific or inducible Cre recombinase 
alleles (Fig. 1a). Cre-mediated ablation of Exosc3 with this allele leads 
to concomitant green fluorescent protein (GFP) reporter induction from 
the Exosc3 locus (details in Methods and Extended Data Fig. 1). B cells 
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were generated from Exosc3©"* and Exosc3©0"™" mice on the 
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replicates, error bars represent standard deviation (s.d.). d, IgG1 CSR efficiency 
in 4-OHT-treated Exosc3eO"™’* and Exosc3¢O"' © B cells after 72h of LPS 
plus IL-4 stimulation. Indicated Exosc3 genotypes on a ROSA260*8'7/* 
background. Mean values from three biological replicates are indicated. Error 
bars represent standard error of the mean (s.e.m.). e, Flow cytometric analysis 
of Peyer’s patch germinal centre B cells. Percentage of B220° PNA™ germinal 
centre B cells amongst all B220~ cells is indicated. Experiment was replicated 
three times. f, SHM analysis of Peyer’s patch derived GFP* germinal centre B 
cells at AID substrate CG base pairs at the JH4 intron. Mean values determined 
from 197 (Exosc3-O4’*) and 203 (Exosc3-OLW OO”) sequence clones are 
indicated. *P < 0.01 (t-test), **P < 0.01 (proportion test). 
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Figure 2 | RNA exosome depletion reveals xTSS-RNAs. a, Differential 
expression analysis for various transcript classes. Horizontal bars indicate 95% 
confidence intervals with Bonferroni adjustment of log, fold change between 
Exosc3©O'N’O'% and Exosc3. Circles indicate mean values. Red bars 
indicate transcript types expressed significantly higher compared with mRNA 
(two biological replicates). P values were determined by Wilcoxon rank-sum 
test. b, Strand-specific distribution of RNA-seq reads upstream and 
downstream of expressed coding gene TSSs (two biological replicates). 

c, Boxplot indicating gapped xTSS-RNA length distribution. Median values 
from two biological replicates are indicated. Whiskers represent 99% of data 
values. **P < 0.01 (Wilcoxon rank-sum test). d, Gap length distribution 
between xTSS-RNAs and coding transcript TSSs. y-Axis indicates number of 
xTSS-RNAs in each gap length group (x-axis). Compiled from two biological 
replicates. e, Percentage of TSS-RNA expressed at indicated reads per kilobase 
per million reads (RPKM). Compiled from two biological replicates. 


4-hydroxytamoxifen (4-OHT)-inducible ROSA26(""""* background. 
4-OHT treatment of these cells produced robust Exosc3 gene inversion, 
loss of Exosc3 messenger RNA and protein, and induction of GFP (Fig. 1b, c 
and Extended Data Fig. 1d-f). 

We evaluated CSR efficiency in ex vivo cultured B cells upon 4-OHT- 
mediated ablation of Exosc3. Immunoglobulin (Ig)G1 class switch recom- 
bination (CSR) was decreased approximately fourfold in Exosc3°0'YO"% 
B cells compared to littermate control Exosc3©°"* B cells (Fig. 1d and 
Extended Data Fig. 2a) despite comparable AID expression and increased 
nascent IgSy1 transcription (Extended Data Figs 1f and 2b, c). To deter- 
mine RNA exosome involvement in somatic hypermutation (SHM), we 
generated Exosc3°"™* and Exosc3-"N ON mice expressing Cre recom- 
binase at early (Cd1 9°) and late stages (Aicda“”) of B-cell development 
(Aicda™ allele details in Extended Data Fig. 2d-f). Cdl 9°*_mediated 
ablation of Exosc3 leads to B-cell developmental arrest preceding the 
germinal centre reaction (Extended Data Fig. 2h). However, Aicda“”*- 
mediated deletion permits robust germinal centre B-cell production in 
Exosc3-ObCON mice, with a moderate increase in cell number compared 
to Exosc3°0"™* mice (Fig, le). The kinetics of GFP induction and main- 
tenance between Exosc3O""’* Aicda®”” * and Exosc3 ON A iceda”* 
B cells ex vivo demonstrated little to no visible growth advantage between 
deleted (GFP*) and non-deleted (GEP ) cells (Extended Data Figs 3a, b). 
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VPD450 dye dilution assays demonstrated comparable proliferation 
between Exosc3 0" * Aicda”* and Exosc3 NA icda* B cells 
(Extended Data Fig. 3c, d). We determined the inversion efficiency of 
Exosc3°" in sorted Exosc3 (OPW OO germinal centre B cells to be ~70%, 
compared to nearly complete inversion in Exosc3°"’* (Extended 
Data Fig. 1 8). SHM downstream to the Igh J,;4 exon was evaluated 
in Exosc3°0""* AicdaC’* and Exosc3 NA icda* germinal cen- 
tre B cells. Total mutation frequency was reduced in Exosc3°'™0ON 
mice (Extended Data Fig. 2g) and exacerbated at direct AID target dC:dG 
base pairs (53% of Exosc3 OIN/*” PD < 0.01) (Fig. 1f). Importantly, since 
AID expression precedes Exosc3 deletion in these assays, we expect some 
SHM and CSR to occur before Exosc3 depletion, thus underrepresenting 
the complete effect of RNA exosome deletion on SHM and CSR. 

Various ncRNA species, particularly those associated with transcrip- 
tion regulation, are substrates of RNA exosome’*”°. To uncover ncRNA 
substrates of RNA exosome in B cells, we performed whole transcrip- 
tome RNA sequencing on Exosc3-deficient cells. We reconstructed the 
transcriptomes of Exosc3 WI/WT (wild-type) and Exosc3 ©’ B cells 
and hereafter refer to the Exosc3°0"”©° transcriptome as the ‘exo- 
tome’ (Fig. 2a). Small nucleolar RNAs (snoRNAs) and small nuclear 
RNAs (snRNAs), known targets of RNA exosome in Saccharomyces 
cerevisiae~', were upregulated in the exotome (Fig. 2a). The identity, read 
counts and coordinates of the ncRNAs analysed in Fig. 2a are provided 
in Supplementary Tables 1-3. Greatly upregulated in Exosc3-deficient B 
cells were RNA exosome substrate TSS RNAs (xTSS-RNAs) (Fig. 2a). 
Short ncRNAs arising from TSSs have been shown previously to be RNA 
exosome substrates in mammalian cells'*"*”’, although their genome- 
wide distribution and characteristics are not fully understood. xTSS- 
RNAs are expressed at regions upstream of mRNA-associated TSSs 
(Fig. 2b and Extended Data Fig. 4a) and either overlap with cognate 
mRNA TSSs (ungapped xTSS-RNA) or possess distinct TSSs (gapped 
xTSS-RNA). RNA-sequencing (RNA-seq) reproducibility was statist- 
ically strong (p = 0.95; Extended Data Fig. 4b). 

In Exosc3/" cells, xTSS-RNA average length was ~600 bp, whereas 
in Exosc3°0'YO'% cells xTSS-RNAs were slightly longer (Fig. 2c and 
Extended Data Fig. 4c). Average TSS distance between xTSS-RNA and 
cognate mRNA was ~ 150 bp (Fig. 2d). Many genes in Exosc3"”""" cells 
display low expression of xTSS-RNA (Fig. 2e). However, Exosc3 deletion 
results in a shift towards higher xTSS-RNA expression (Fig. 2e). Strand- 
specific RNA-seq experiments demonstrated that xTSS-RNA transcription 
largely occurs antisense to mRNA transcription genome-wide (Fig. 2b). 
While sense genic transcripts are comparable between Exosc3”"”" and 
Exosc3©0'"O'N TSS antisense transcripts in Exosc3 Ware appro- 
ximately fourfold higher (Fig. 2b). Gapped xTSS-RNA expression cor- 
related poorly with cognate mRNA expression genome-wide (p = 0.11; 
Extended Data Fig. 4d). Furthermore, xTSS-RNA is not uniformly 
expressed across the B-cell genome. Actively transcribed genes devoid 
of xTSS-RNA expression include B-actin, [/2rg and Ung (Extended Data 
Fig. 4e-g). Collectively, we have identified divergently transcribed anti- 
sense xT'SS-RNAs expressed from a subset of transcribed genes within 
B cells. 

AID introduces mutations within Igh switch sequences during CSR. 
Upstream of the inducible CSR-specific [gg] (also known as Ighg1) germ- 
line transcript we observed strong accumulation of xT'SS-RNA in Exosc3- 
deficient B cells (Fig. 3a). Expression of Igg] xTSS-RNA was confirmed 
through quantitative polymerase chain reaction with reverse transcription 
(qRT-PCR) and northern blotting (Extended Data Fig. 5a, b). Further- 
more, we observed robust xTSS-RNA expression at the AID target genes 
Myc, Pax5, Cd83, Pim1 and Cd79b (Fig. 3b and Extended Data Fig. 5c-f). 
xTSS-RNA transcription at these genes was largely antisense (Extended 
Data Fig. 6a). 

Recent studies using translocation capture sequencing (TC-Seq) or 
high-throughput genome-wide translocation sequencing (HTGTS) have 
identified target genes undergoing recurrent translocations to Igh due 
to AID-generated DNA breaks in B cells**. These analyses have revealed 
frequent translocations occurring near the TSSs of actively transcribed 
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Figure 3 | xTSS-RNA expression marks AID- 
dependent translocation hotspots in the B-cell 
genome. a, Antisense Igg] xTSS-RNA. Strand- 
specific RNA-seq reads 2 kb upstream and 
downstream of Igg] germline transcript TSS. 
Compiled from two biological replicates. Chr, 
chromosome. b, Profile of RNA-seq reads at Pim1 
locus (8.6 kb window). Red arrow indicates xTSS- 
RNA. Four biological replicates. c, Venn diagram 
of genes with xTSS-RNAs, genes undergoing 


— Exosc3°O'NCOIN antisense 


np 


7 
Igg1 xTSS-RNA 


recurrent AID-dependent chromosomal 


Chri2: 114,579k 114,578k 114,577k 114,576k 


translocations* and somatically hypermutated 


c 
With xTSS-RNA 


=e 
Oo 


a 


1,729 


Oo 


P< 0.0001 
P <0.0001 


it 
a 


Translocation 
hotspots 


log, Exosc3CO"N/COIN xTSS-RNA (RPKM) 


13a) Hypermutated 


an 
S 
+ 


genes”. xTSS-RNA group compiled from four 
biological replicates. P values determined by 
Fisher’s exact test. d, Enrichment of gapped xTSS- 
RNA expression amongst translocation hotspots in 
Exosc3-deficient B cells. Translocation hotspot 
gene set comprises 40 genes (identified in 

c) reported to undergo recurrent AID-mediated 
translocations displaying higher xTSS-RNA 
expression. ‘Other xTSS-RNA genes’ set comprises 
1,694 genes expressing both xTSS-RNA and 
cognate mRNA, but not reported as recurrent 
translocation hotspots. Median values from two 
biological replicates are indicated. **P < 0.01 
(Wilcoxon rank-sum test). e, Myc translocation 


cd 


[ b--- 4eeet 


Cd79b, Cd83, 
Myc, Pax5, Pim1 


breakpoints at sites of xTSS-RNA and genic 
antisense transcription. Mouse B-cell translocation 
frequency’ and antisense transcription are shown 
on the positive and negative y-axes, respectively. 
Compiled from two biological replicates. 


Igh 
rearrangement 
hotspot 


b 
es 
1 


(TC-Seq reads) 


Antisense RNA Translocations ® 
oO 


— Exosc3W™T antisense RNA 


f, Correlation between breakpoints and antisense 
expression (2 kb upstream of TSS to transcription 
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genes*°. We queried these data sets to determine whether recurrent 
translocation partners of Igh express xTSS-RNAs. Our analysis revealed 
a positive statistical correlation between genes expressing xTSS-RNA 
and recurrent AID-dependent translocation (P < 0.0001; Fig. 3c and 
Extended Data Fig. 6b). Specifically, 40 genes were identified through 
this analysis (Fig. 3c and Supplementary Tables 4, 5) and, collectively, 
xTSS-RNA expression was fourfold higher in Exosc3°0"'™©"" B cells 
(P< 0.01; Extended Data Fig. 6c). Even amongst all other xTSS-RNA- 
expressing genes, this group of 40 genes displayed higher xTSS-RNA 
expression (P < 0.01; Fig. 3d and Extended Data Fig. 6d). In contrast, we 
observed no difference in collective mRNA expression for these 40 genes 
between Exosc3"”/" and Exosc3¢ YO’ B cells (Extended Data Fig. 6c). 


Exosc3©ON/COIN sense read bins 


Overlapping a list of genes previously shown to undergo AID-mediated 
hypermutation in mouse B cells” with these 40 xT'SS-RNA expressing 
translocation hotspots revealed 5 genes, consisting of Myc, Pax5, Cd79b, 
Cd83 and Pim1 (Fig. 3c). Mutation of these genes has been observed in 
diffuse large B-cell lymphoma patients**”’. Of the 88 translocation hot- 
spots, 74 contain either TSS-RNA or antisense transcription (Extended 
Data Fig. 7a). Statistical bootstrapping analysis of 10,000 control sets of 
88 genes with similar transcription levels as the translocation hotspot 
gene set would randomly select only 15 xTSS-RNA expressing genes, thus 
validating that the observed group of 40 xTSS-RNA-containing genes 
at translocation hotspots is not solely determined by the level of cog- 
nate mRNA transcription (P < 0.01; Extended Data Fig. 6e). We propose 
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Figure 4| AID recruitment to RNA-DNA hybrid-forming divergently 
transcribed genes. a, Exosc3 promotes AID targeting to divergent promoters. 
ChIP was performed using anti-AID or control IgG. qPCR was performed 
using TSS upstream-specific primers. PCR amplification for the non-divergent 
genes Cmas, March2, Atp13a2 and Fam107b was below the detection limit in 
anti-AID ChIP. Three pairs of mice of each genotype were used. Mean values 
from three technical replicates are indicated. Error bars represent s.d. 

*P < 0.05, **P < 0.01 (log, transformation of Z-test). b, H3K4me3 is unaltered 
in Exosc3-deficient B cells. ChIP was performed using anti-H3K4me3 
(Millipore) or control IgG. qPCR and data analysis are as described in a. Two 
pairs of mice of each genotype were used. NS, not significant (f-test). 

c, d, H3S10ph accumulation at divergent promoters. ChIP was performed using 
anti-H3S10ph (Millipore) or control IgG. qPCR was MAM oe as described in 
a. H3S10ph enrichment relative to input in Exosc3“”7 (c) or fold change of 
Exosc3©0'O'N is indicated (d). Three pairs of mice of Be genotype were 
used. Mean values from three technical replicates are indicated. c, **P < 0.01 
(t-test) for H3S10ph ChIP between divergent and non-divergent promoter sets. 
d, *P < 0.05, **P < 0.01 (log, transformation of Z-test). e, RNA-DNA hybrid 
accumulation at divergently transcribed genes in Exosc3-deficient B cells. qPCR 
was performed as described in a. Two pairs of mice of each genotype were used. 
Mean values from three technical re eplicates are shown. Error bars represent s.d. 
**P < 0.01 (t-test) for Exosc3o"Y ON DNA:RNA immunoprecipitation 
(DRIP) between divergent and non-divergent promoter sets. f, g, Deletion of 
xTSS-RNA-expressing region impairs AID targeting to divergently transcribed 
genes. Anti-AID ChIP was performed on parental or clonal CH12F3 B-cell 
lymphoma lines containing CRISPR-mediated deletions of Cd83 (f) or Pim1 
(g) xTSS-RNA-expressing regions. qPCR was performed using exon 1 specific 
primers. Mean values from three technical replicates are indicated. Error bars 
represent s.d. *P < 0.05, **P < 0.01 (t-test). Indicated genotypes (except 

f, g) areona ROSA26CER2/+ background and B cells were treated with 4-OHT 
and stimulated with LPS plus IL-4. 


that genes undergoing divergent transcription resulting in RNA exo- 
some recruitment, as evidenced through the presence of xTSS-RNAs, 
are preferentially targeted by AID. 
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Many translocations also occur within gene bodies and cannot be 
readily explained by TSS-proximal transcription. However, RNA exo- 
some can also regulate the expression of antisense RNA (asRNA) ini- 
tiating within gene bodies. Strikingly, a considerable number of Myc 
translocation junctions precisely map to a region within intron 1 that 
expresses RNA exosome substrate asRNA (Fig. 3e; additional examples 
in Extended Data Fig. 7b and Supplementary Fig. 1). Many of the 48 
translocation hotspot genes that do not express x TSS-RNAs (Fig. 3c) do 
express RNA exosome substrate asRNA (Supplementary Fig. 1). More- 
over, breakpoints within these translocation hotspots strongly correlate 
with the presence of RNA exosome substrate asRNA (p = 0.79; Fig. 3f). 
Additionally, Cd83 translocation breakpoints mapping to regulatory 
regions contain RNA exosome substrate asRNA (Extended Data Fig. 7c). 
When analysing all asRNAs in Exosc3-deficient B cells (Extended Data 
Fig. 8a), we note a strong positive correlation between asRNA expression 
and the probability of observing translocation breakpoints (p = 0.89; 
Fig. 3g), whereas sense transcription correlated poorly (p = 0.38; Fig. 3h). 
Altogether, we provide evidence that AID target sites in the B-cell genome 
possess RNA exosome substrate asRNA, both TSS-proximal and within 
gene bodies. 

An outstanding question concerns the molecular mechanisms relating 
AID targeting to genes expressing xTSS-RNAs or asRNAs. Divergent pro- 
moters, including those expressing xTSS-RNAs, occupy two pre-initiation 
complexes positioned divergently and separated by ~150 bp. Divergent 
transcription enhances localized DNA melting surrounding the two TSSs 
as polymerases initiate transcription, thus generating ssDNA structures”. 
Antisense transcripts from such promoters undergo early termination 
leading to RNA exosome recruitment”’. When stalled antisense tran- 
scripts are not efficiently removed, stabilization of transcription-coupled 
R-loops can prolong ssDNA structures. AID requires ssDNA substrates 
for recruitment and subsequent DNA deamination’ which is further 
enhanced by pausing and/or stalling of RNA Pol II°. Using chromatin 
immunoprecipitation (ChIP) assays we find that Exosc3 promotes AID 
occupancy at target genes Pim1, Pax5, Myc, Cd79b and Cd83 (Fig. 4a). 
This observation cannot be explained simply through differences in gene 
expression, as H3K4me3 abundance is similar between Exosc3'/W" 
and Exosc3©0'O'" 8 cells, indicating comparable transcription ini- 
tiation (Fig. 4b). Consistent with these observations, xTSS-RNA express- 
ion is enriched at AID- and Spt5-occupied genes in B cells (Extended 
Data Fig. 8b). Similarly, x TSS-RNA is also enriched at AID target genes 
identified by replication protein A sequencing (RPA-seq)** (Extended 
Data Fig. 8c), a marker of ssDNA. H3S10ph is a chromatin mark 
associated with ssDNA-containing R-loop structures”. We observe that 
AID-targeted divergent promoters accumulate H3S10ph and Exosc3, 
unlike robustly transcribed non-divergent promoters (Fig. 4c and Ex- 
tended Data Fig. 9a-e). H3S10ph accumulation is further enhanced at 
divergent promoters Pim1, Pax5, Cd79b and Cd83 upon loss of Exosc3 
(Fig. 4d). ssDNA accumulation was also evaluated using the RNA-DNA 
hybrid-specific $9.6 antibody”. We find that AID-targeted divergent 
promoters (Pim1, Cd79b, Cd83) accumulate RNA-DNA hybrids more 
strongly than transcribed non-divergent promoters (March2, Cmas, 
Atp13a2) (Fig. 4e). Finally, deletion ofxTSS-RNA regions corresponding 
to Pim1 or Cd83 (Extended Data Fig. 9f-i) impairs AID recruitment to 
these genes (Fig. 4f, g) and reduces AID-mediated hypermutation within 
the first kilobase pair of Cd83 (Extended Data Fig. 9j). 

On the basis of our findings, we propose that divergent TSSs gen- 
erating RNA exosome substrates have a key role in recruiting AID and 
generating ssDNA structures across the B-cell genome (Extended Data 
Fig. 10). In conjunction with ssDNA formation, increased RNA Pol II 
stalling at divergent promoters may further facilitate AID recruitment. 
Similarly, asRNAs generated within gene bodies can also create ssDNA 
structures, serving as AID substrates and leading to chromosomal trans- 
location (summarized in Extended Data Fig. 10). Our study provides 
evidence that in addition to RNA Pol II stalling and ssDNA generation, 
antisense transcription is important for AID targeting throughout the 
B-cell genome. 
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METHODS 


Generation of the Exosc3©° allele. Bacterial homologous recombination 
methodologies*' were used to modify a bacterial artificial chromosome (BAC) con- 
taining the mouse Exosc3 locus (clone bMQ386a13). Two sequential BAC modifica- 
tions were performed. First, lox2372 and loxP sites were inserted between Exosc3 
exons 1 and 2. Subsequently, the COIN module (antisense to Exosc3), a second set 
of lox2372 and loxP sites, and an FRT-flanked neo selection cassette were inserted 
between Exosc3 exons 3 and 4. The COIN module is comprised of a 3’ splice acceptor 
sequence, followed by a T2A-GFP open reading frame, followed by a polyadenyla- 
tion sequence. Correctly modified Exosc3©'""” BAC clones were identified by PCR 
screening across all four recombination junctions and verified by restriction digestion 
followed by pulse field electrophoresis. The entire 6.9 kb region between the upstream 
and downstream homology arms of the Exosc3©°"™"” BAC clone used for embryonic 
stem (ES)-cell targeting was confirmed by sequencing. The BAC-based Exosc3¢™"? 
targeting vector was electroporated into ROSA26°"*”’*, 129S6/SvEvTac X C57BL/ 
6Tac hybrid ES cells and correctly targeted clones were identified by a loss of allele 
assay". Exosc3°""™"""* chimaeric mice were generated by ES-cell microinjection of 
blastocysts and crossed with Tg(ACTB:FLPe) mice to excise the neo cassette and pro- 
duce germline transmission of the Exosc3©°" allele. The FLPe transgene was subse- 
quently bred out for the production of all Exosc3©°’ experimental cohorts. All mouse 
experiments were performed in accordance with approved Columbia University 
Institutional Animal Care and Use Committee protocols. 

Cell culture and CSR. Splenic B cells from sex-matched littermate mice were pre- 
pared using CD43 microbead (Miltenyi Biotec) negative selection and cultured in 
RPMI 1640 containing 15% FBS. Ex vivo CSR cultures using the ROSA26°"*” allele 
were cultured for 16 h with 100 nM 4-hydroxytamoxifen (Sigma) and 20 pg ml‘ LPS 
(Sigma) followed by the addition of IL-4 (R&D Systems) at 20 ng ml 1, and cultured 
for an additional 72 h. CSR culture conditions using the Aicda“” allele were identical 
with the exception that 4-hydroxytamoxifen was not used. Cells were stained using 
fluorescent antibodies against B220 and IgG1 (BD Biosciences). Data were acquired 
on a FACSAria cell sorter (BD Biosciences) and analysed using FloJo software (Tree 
Star). Antibodies used for western blot purposes: Exosc3 (Santa Cruz Biotechnology), 
actin (Abcam) and goat anti-rabbit IgG HRP (Jackson ImmunoResearch). All experi- 
ments involving mice were performed with known genotypes and therefore per- 
formed unblinded. Biological replicates involve B cells isolated from separate mice. 
RNA preparation and qPCR. Total RNA was isolated from cells using Trizol rea- 
gent (Life Technologies). RNA was resuspended in water and quantified using a 
Nanovue Plus spectrophotometer (GE Healthcare Life Sciences). RNA samples were 
treated with DNase I (Turbo DNA-free kit, Life Technologies), eluted in water and 
re-quantified. 1.5 1g of RNA were then converted to cDNA using random hexamers 
and the Superscript III First-Strand Synthesis System for RT-PCR (Life Technologies). 
Exosc3 and Aicda mRNA measurements were performed using the TaqMan Gene 
Expression Assay (Applied Biosystems). TaqMan assay ID numbers for Exosc3 and 
Aicda were Mm01345308_m1 and Mm00507774_m1, respectively. All other quant- 
itative RT-PCR experiments were performed using SYBR Green Master Mix (Roche 
Applied Science). Primer pairs for the quantification of associated xTSS-RNAs were 
as follows: Pim1, 5’-CACATGCACGTGGAAATACCA-3' and 5'-CATCCATAA 
AAGTTATGGAGTC-3’; Pax5, 5'-CTGCTTTTTCAGGTCTAGCTC-3’ and 5’-CC 
CATTCAAAAGCTCATTAAG-3'; Il4ra, 5'-GGCTGTTGCTCATTTTCCCAA-3’ 
and 5’-TGTGGGCGAGAGAACAACTTC-3’; Myc, 5'’-AGCGCAGCATGAATTA 
ACTGC-3' and 5'-GTATACGTGGCAGTGAGTTG-3’; Igg1, 5'-GTATCTTGITG 
GTGCTATCTCA-3’ and 5’-TGGGATCTGCTACACAGGTTT-3’. Expression levels 
for individual transcripts were normalized against B-actin and/or cyclophilin with 


similar results. Fold change in transcript levels were calculated as fold change = 
2(CtWT.GOI — CtCOIN/COIN.GOD /9(CtWT.actin — C1COIN/COINactin) Ct cycle threshold. 


GOL gene of interest. 

Northern blotting. Total RNA (7.5 ug) isolated from splenic B cells was electro- 
phoresed in denaturing conditions on a 1% agarose-formaldehyde gel at 5.5 Vm’. 
The gel was rinsed several times in deionized water followed by 20X SSC and then 
transferred overnight by capillary transfer onto an Amersham Hybond-XL mem- 
brane. The RNA was crosslinked to the membrane using a Stratagene UV crosslinker 
and then stained with methylene blue in 0.3 M sodium acetate to ascertain transfer 
efficiency. Subsequently the membrane was prehybridized for 5 h before being hybri- 
dized with a cDNA encoding the gene of interest (approximate size of probe: 400 bp), 
radiolabelled through the random primed labelling technique (High Prime, Roche 
Applied Science) and purified twice over Probequant G50 microcolumns (GE Health- 
care) before addition to the hybridization reaction. The membrane was hybridized over- 
night at 42 °C in hybridization solution before being washed twice in 0.1% SDS/2 SSC 
at room temperature followed by two washes in 0.1% SDS/2 SSC at 65 °C, all washes 
for 15 min. Membranes were subsequently exposed to film for varying amounts of time. 
RNA-seq analysis. rRNA-depleted total RNA was prepared using the Ribo-Zero 
rRNA removal kit (Epicentre). Libraries were prepared with Illumina TruSeq and 


TruSeq Stranded total RNA sample prep kits, and then sequenced with 50-60 million 
of 2X 100 bp paired raw passing filter reads on an Illumina HiSeq 2000 V3 instrument 
at the Columbia Genome Center. To construct the transcriptome of Exosc3 WIWT and 
Exosc3©0''/O% 8 cells, we first mapped all reads of total RNAs to the mouse ref- 
erence genome (mm9) with TopHat (v. 1.3.2). Cufflinks (v. 1.2.1) was subsequently 
applied to assemble the whole transcriptome and to identify all possible transcripts”. 
To obtain short RNAs (potential exosome targets), we set the overlap radius as 1 and 
merged repeated samples. All resulting transcripts are further annotated under the 
supervision of a comprehensive collection of RNA databases including mRNA, 
snoRNA, long intergenic noncoding RNA (lincRNA), microRNA and other anno- 
tated non-coding RNAs (Supplementary Table 1). Specifically, we overlapped all 
assembled transcripts with known RNAs in the collected database, and annotated 
the transcripts by their adjacent known RNAs. A transcript is annotated as one 
isoform of a known RNA if both the TSS and transcription end site (TES) of the 
transcript are close (less than 200 bp) to those of the known RNA. If comparing with 
a known RNA, and the TSS ofa transcript is shifted less than 200 bp, this transcript 
is annotated as the mixture of an upstream transcript (ungapped xTSS-RNA) and 
a known RNA. Similarly, if the TES is shifted more than 200 bp, the corresponding 
transcript is annotated as a read through. Transcripts that have no overlap with any 
known RNA located within 2,000 bp upstream of a known RNA are annotated as 
gapped upstream transcripts (gapped xTSS-RNA). Conjoint transcripts were anno- 
tated on the basis of containing sequences from multiple known RNAs. Biological 
replicates indicate that each RNA-seq data set was generated from B cells isolated 
from separate mice. B-cell translocation hotspots were defined based on supplemen- 
tary table of Klein et al’. AID and Spt5 binding loci are based on ChIP-seq data from 
Pavri et al.’. 

Statistical analysis. To test the significance of the difference of two vectors, a two- 
sided nonparametric Wilcoxon rank-sum test was applied to calculate the P values. 
To test the difference between two proportions, the following equation was used: 


z Pi —p2 
1 1 
P(1—P) (= + =) 


where p, and p, are two proportions, P is the expected value, and n, and nj are the 
population size. P values were then generated by normal distribution. We applied the 
pipeline of DEseq for the normalization of library size, and gene differential express- 
ion analysis*’. Multiple comparison analysis was performed in MATLAB R201 1a. 
One-way analysis of variance (ANOVA) was followed by multiple comparison pro- 
cedure with critical values from the ¢ distribution, after a Bonferroni adjustment. 
SHM. Peyer’s patches were excised from 2.5-3.5-month-old paired littermates and 
gently dissociated by passage through a 70 tm cell strainer. Germinal centre B cells 
were stained with anti-B220 (BD Biosciences) and peanut agglutinin (Vector Labo- 
ratories). DAPI stain was added just before cell sorting to exclude dead cells. Cells 
were directly sorted into lysis buffer containing proteinase K (Viagen) using a 
FACSAria cell sorter (BD Biosciences) and incubated at 55 °C overnight. Fifteen 
micrograms of GlycoBlue (Life Technologies) were added to the lysates and genomic 
DNA was purified via ethanol precipitation. JH4 intron was amplified by PCR using 
LA Taq (Takara) and a primer pair (J558FR3Fw: 5'-GCCTGACATCTGAGGACTC 
TGC-3' and JH4intronRv: 5'-CCTCTCCAGTTTCGGCTGAATCC-3’) that requires 
a VDJ rearrangement of the Igh locus for amplification to occur**. JH4 amplicons 
were cloned into the pCR2.1-TOPO vector (Life Technologies) and sequenced using 
M13 primers. Mutation analysis at Cd83 in CH12F3 cells was performed on a 488 bp 
sequence between PCR primers GCCTCCAGCTCCTGTTTCTA and TGTTGCTT 
TCACTGCAGCTCTC. 

Analysis of TC-Seq. To capture genome wide the translocation breakpoints of B 
cells, we downloaded the TC-Seq data sets* from the Sequence Read Archive (http:// 
www.ncbi.nlm.nih.gov/sra) under accession number SRA039959, and followed a 
similar computational workflow as described by Oliveira et al.*’. All alignments are 
performed by BWA* with default parameters, and the counting of reads is per- 
formed in bedtools*’. 

Translocation breakpoint probability modelling. To predict the probability of 
observing breakpoints in a given genomic region, we assume the number of TC-Seq 
reads in this region follows a negative binomial model with parameters r and p. We 
applied a maximal likelihood method to estimate the parameters, as well as the con- 
fidence interval of the parameters. For a given genomic region, the probability of 
occurrence of translocation breakpoints is then defined as the probability of har- 
bouring more than x TC-Seq reads. In this manuscript, two types of regions are sep- 
arately considered. In one type of region, each base pair from 2 kb upstream of the 
TSS to the TES of translocation hotspot genes is binned into ~10 tiers according to 
their expression level of antisense transcripts. Approximately 4 x 10° genomic posi- 
tions were binned according to antisense RNA expression level. Negative binomial 
distribution fitted in each region to estimate breakpoint probability. In the other 
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type, genomic regions of all collected genes that do not harbour known antisense 
RNAs (28,947 genes) are binned into 46 tiers according to the expression level of 
their antisense transcripts to estimate the parameters of the negative binomial model. 
ChIP. Crosslinking was performed on cultured cells using formaldehyde. Sonica- 
tion was performed on ice using a Branson Sonifier 250 apparatus for 25 cycles, 
each cycle comprising 20 s of sonication at duty cycle 30% followed by a 2 min rest 
period. Lysates were pre-cleared for 1 h. Immunoprecipitation of lysates was per- 
formed overnight at 4°C using indicated antibodies. SepharoseA/G beads were 
added for 90 min with continued rotation. Subsequently the beads were washed by 
the standard series of washes (low salt, high salt, LiCl, and TE) and ChIP products 
were eluted followed by RNaseA treatment overnight at 60°C and proteinase K 
treatment for 2h at 55°C. ChIP DNA was recovered using ethanol precipitation. 
Primers used for ChIP quantitative PCR were as follows. Myc, CGGTTGATCACC 
CTCTATCACTC and GCTCCACACAATACGCCATGTAG; Pim1, CCCAGGAT 
CTAGCCCACATAACATC and AGCGTAGCAAGTTGTGAGAAATGG; Pax5, C 
TGCTAGGATGGTTCTGCTTGG and CAACTCAATTGCAACCTCCATAGGTC; 
Cd79b, TGCTGATTGAGAAGGTTGGTGTG and GGAAGGGGTTGCTCCTGAA 
TC; Cd83, AGATCTCCCTTGCTCAAACAACG and GACCTGCTACTCTCCAGA 
TITTGTG; Cmas, GGAAAACGGAAAGAGGCTGGAG and TGAGCTCAGAGG 
AGCCTCTAG; Atp13a2, CAGCCTGTCCTTTTCCGTCTATC and AGCTCGCTG 
AGATCTTGATGC; March2, GCAGCAAGTCTACAGCCAGAG and GCCTCTGA 
GTATCATCTGCCAATC; Fam107b, GACACCTTCCATTAGACAGGTGAC and 
AGATGAGAGCTCTGGATCCTTGG. Technical replicates of ChIP were performed 
from the same cell type. 

DRIP. DRIP was performed on cultured B cells following a previously described 
protocol”*. Briefly, cells were spun down and digested overnight in TE with protei- 
nase K, followed by phenol-chloroform extraction and ethanol precipitation. Geno- 
mic DNA was digested using BsoBI, Nhel, Ncol and Stul either with or without 
RNase H (NEB). Purified DNA was then immunoprecipitated using S9.6 antibody 
(gift from F. Chédin). Technical replicates of DRIP were performed from the same 
cell type. 

CRISPR/Cas9-mediated targeted deletions. Guide RNA sequences were designed 
using an online tool (http://tools.genome-engineering.org). Guide RNA-encoding 
oligonucleotides (Cd83, AGTGCCCAACACTACCTAAT and TTCCGAAGCCTC 
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AGGGCGCG; Pim1, ATCAGACACATTCCGAGAAG and CTCTGTGTTTCCCG 
GAGATT) were cloned into the Bbsl site of pSpCas9(BB)-2A-GFP (pX458 Addgene) 
as described*’. Guide RNA/Cas9 expression vectors were electroporated into CH12F3 
cells using Amaxa Nucleofector (Lonza). Cells were cloned using limiting dilution 
3 days after electroporation. Individual clones were screened for homozygous deletion 
of xTSS-RNA-encoding regions using PCR. Screening primers for Cd83 xTSS-RNA 
deletion were CCATGCTACAATGCACAGACCTAC and CAGCCTAGAAACA 
GGAGCTGGAG. Screening primers for Pim1 xTSS-RNA deletion were CCAGGG 
ATCAAACCTAGGATTTTC and CAGAAGACGCCCTATTTGCATAAGG. AID 
ChIP primers were as follows: Pim1, CTCGCTCCGCCGCCGCTGCTG and CGCA 
GGTGGGCCAGGGAGTTGAT; Cd83, GCCTCCAGCTCCTGTTTCTA and TCG 
GAGCAAGCCACCGTCAC. 
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Extended Data Figure 1 | Exosc3 gene targeting and functional validation of 
the Exosc3©°" allele. a, Schematic of the Exosc3©°"™"” BAC targeting vector. 
Blue shaded boxes indicate Exosc3 exons 1-4. Lox sites are represented by 
triangles. The GFP-expressing gene trapping module is represented by green 
arrows. Upstream, downstream and internal homology arms are 41.3, 81.5 and 
2.4kb, respectively. b, Confirmation of Exosc3©0"™""* targeted embryonic 
stem (ES)-cell clone AF8. The loss-of-allele (LOA) assay** was used to screen 
ES-cell clones for wild-type allele copy number at defined locations within 
Exosc3 introns 1 and 3 that have been modified to allow for distinction between 
wild-type and COINneo alleles by TaqMan-based qPCR. A probe for a non- 
targeted locus, Wnt2b, served as an internal qPCR standard for both copy 
number and total input DNA. Data represent mean values from six technical 
replicates. Error bars represent s.d. Ct, cycle threshold. c, HindIII restriction 
map of the wild-type (WT), COIN and COINinv alleles of Exosc3. The black 
shaded box indicates the location of the probe used for Southern blotting in 
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Fig 1b. d, Flow cytometric analysis of GFP expression in naive or 4-OHT- 
treated, LPS plus IL-4-stimulated B-cell cultures. Indicated Exosc3 genotypes 
are on a ROSA26("®"’* background. One pair of littermate mice was used. 
Three biological replicates were performed. e, Profile of RNA-seq mapped 
reads at the Exosc3 locus from 4-OHT-treated, LPS plus IL-4-stimulated B-cell 
cultures. Indicated Exosc3 genotypes are on a ROSA26°°""""" background. 
Four biological replicates were performed. f, Immunoblot analysis of Exosc3 
and AID protein expression in whole cell extracts from 4-OHT-treated, 

LPS plus IL-4 stimulated B-cell cultures. Actin was used as a loadin 

control. Wild type, Exosc3” "7 ROSA260 "+, C/C, Exosc3 OY OON 
ROSA26C*=8/" One pair of littermate mice was used. Two technical 
replicates were performed. g, Exosc3°?"’: Exosc30"™” ratio in germinal centre 
B cells determined by qPCR copy number analysis (three technical replicates, 
error bars represent s.d.). 
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Extended Data Figure 2 | Exosc3-deficient B cells are impaired in CSR and 
SHM. a, Representative flow cytometric analysis for surface IgG1 on purified B 
cells treated with 4-OHT, and stimulated with LPS plus IL-4. Numbers indicate 
the percentage of GEP* B220* B cells having isotype switched to IgG1. One 
pair of littermate mice was used. Three biological replicates were performed. 
b, Quantitative RT-PCR time-course analysis of Aicda mRNA expression in 
naive (day 0) or 4-OHT-treated (days 2-4), LPS plus IL-4 stimulated B-cell 
cultures. Indicated Exosc3 genotypes are on a ROSA26°°""""* background. 
Expression levels are normalized to cyclophilin (Ppia) and plotted relative to 
naive Exosc3©°"’*. Six littermate pairs of each genotype were used. Data 
represent mean values from three technical replicates. Error bars represent s.d. 
**P < 0.01 (t-test). c, Quantitative RT-PCR analysis of Ighg] switch region 
intron expression. Primers were designed to amplify a region of the Ighg1 GLT 
intron upstream of the Ighgl switch region core repeat, but downstream of the 
Ighg1 non-coding I exon. Two independent pairs of littermate mice of each 
genotype were used to obtain total RNA from B-cell cultures treated with 
4-OHT and stimulated with LPS plus IL-4. Indicated genotypes are on a 
ROSA26(°*®/* background. Data represent mean values from three technical 
replicates. Error bars represent s.d. Two biological replicates were performed. 
d, Schematic of the targeted Aicda“” allele. An open reading frame comprising 
a nuclear localization signal fused to Cre recombinase was used to disrupt the 


o4 


hang ni ory rey erry 
0 10 10? tot 08 


ATG start codon in exon 1 of Aicda. Exons are represented as numbered boxes. 
e, Specific induction of Aicda©” activity upon LPS plus IL-4 stimulation of B 
cells. Flow cytometric analysis of Aicda°” activity (as determined by GFP 
expression) in B220* and B220” naive splenocyte populations (top panel). 
Aicda“” induction in LPS plus IL-4 stimulated B-cell cultures (bottom panel). 
One pair of littermate mice was used. f, Aicda™ is a functional null allele. CSR 
to IgG] isotype is abrogated in Aicda°’’“* homozygous B cells stimulated with 
LPS plus IL-4. Numbers above gate indicate the percentage of GFP” B cells 
having isotype switched to IgG1. One pair of littermate mice was used. g, SHM 
analysis of Peyer’s patch derived GFP" germinal centre B cells. Mutation 
frequencies were determined by sequencing a 645 bp intronic region 
downstream of the JH4 gene segment of the immunoglobulin heavy chain 
(IgH) locus. Two littermate pairs of each genotype were used. Two biological 
replicates were performed. Mutation frequencies represent mean values. P 
values were determined by proportion test. h, Flow cytometric analysis of 
Peyer’s patch derived germinal centre B cells from Exosc3©"™* and 
Exosc3°0'O'N mice on a Cd19°""* background were identified as B220* 
PNA” populations. The percentage of germinal centre B cells amongst all 
B220° cells is indicated. One pair of littermate mice was used. Three biological 
replicates were performed. 
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Extended Data Figure 3 | Proliferation analysis of Exosc3-deficient B cells. 
a, FACS analysis indicating the percentage of GFP-negative (left gate) and GFP- 
positive (right gate) B cells 3.5 days after LPS stimulation. One pair of littermate 
mice was used. Two biological replicates were performed. b, Kinetic analysis of 
GFP-positive B-cell accumulation at indicated time points post-LPS 
stimulation. Indicated Exosc3 genotypes are on a Aicda“”* background. One 
pair of littermate mice was used. Two biological replicates were performed. 
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c, Proliferation analysis determined by VPD450 dilution at 1.5 and 2.5 days 
post-LPS stimulation. One pair of littermate mice was used. d, Quantitative 
RT-PCR analysis of Exosc3, Aicda and GFP mRNA expression in GFP* cells at 
3.5 days post-LPS stimulation. Expression levels are normalized to -actin and 
plotted relative to Exosc3©0"”*. One pair of littermate mice was used. Data 
represent mean values from three technical replicates. Error bars represent s.d. 
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Extended Data Figure 4 | Transcriptome analysis of Exosc3-deficient B 
cells. a, Genome-wide expression level analysis upstream and downstream of 
TSS region for expressed protein coding genes. Coding genes with FPKM >1 
were determined to be expressed. Analysis was restricted to coding genes that 
do not have any known genes within a 4 kb upstream boundary. Indicated 
genotypes are on a ROSA26°°"*”’* background. One sex-matched littermate 
pair was used. Two biological replicates were performed. b, Replicate analysis of 
genome-wide studies. Plots indicate the expression levels of individual genes in 
Exosc3 “TF and Exosc3¢O"'VO B cells treated with 4-OHT and stimulated 
with LPS plus IL-4 from two separate littermate pairs. B cells were purified, 
cultured and FACS sorted, and RNA was purified and sequenced by RNA-seq 


all independently between the two experiments. Indicated genotypes are on a 
ROSA26CER2/+ background. Pearson correlation is indicated. c, The 
distribution of observed lengths for all gapped xTSS-RNAs in Exosc3-deficient 
B cells. Data were compiled from two biological replicates. d, Scatter plot 
indicating weak correlation between expression of downstream coding 
transcript and upstream gapped xTSS-RNA at divergently transcribed loci in 
Exosc3-deficient B cells. Pearson correlation is indicated. e-g, Profile of RNA- 
seq mapped reads at the B-actin locus (e) (Actb; 7.6 kb window), I2rg locus 
(f) (5.4 kb window) and Ung locus (g) (12 kb window). Indicated genotypes are 
ona ROSA26°°"*/* background and B-cell cultures were treated with 4-OHT 
and stimulated with LPS plus IL-4. Four biological replicates were performed. 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


te) 


14 5 
s 12- se (nl Exosc3W™T 
g 10 oar I) Exosc3concon = 95 KP 
a * 
S &) oe ae 
2 ant E 
& 47 
2 =| j j 
WL 

(a : ; 1 

Piml Pax5 l4ra Myc 


Cc Exosc3W™T 


Exosc3COIN/coiN 


1.9 kb 


IgG 
xTSS-RNA 


Myc 


1.0 kb xTSS-RNA 


1.0 kb 
0.5 kb 


18S 


Pax5 


d ExoscQW™T 


pen 


Exosc3COIN/CoIN 


Cd83 


e ExoscoW™™T 


ees | Cae 


Exosc3COIN/COIN 


ey 
oe 


RPKM 
PKI 


Exosc3¥™T Exosc3°O!N/COIN 


Exosc3¥™T Exosc3COIN/COIN 


RPKM 


21 Cd83 xx 
6 
3 


Exosc3¥™T Exosc3CO!N/COIN 


10) Paxd 


RPKM 
RPKM 


Exosc3W™T Exosc3COINCOIN 


Extended Data Figure 5 | xTSS-RNA expression at AID target genes. 

a, Quantification of xTSS-RNA expression levels of AID target genes via 
quantitative RT-PCR from two independent experiments. Indicated genotypes 
are on a ROSA26°"®/* background. b, Northern blot analysis of xTSS-RNA 
expression at Myc and Igsy1 loci. WT, Exosc3"™/“T ROSA26CER?/+. CIC, 
Exosc3°0™ON ROSA26F8/* | c_e, Profiles of RNA-seq mapped reads at 
the Pax5 (c) (72 kb window displaying exons 1-6), Cd83 (d) (22 kb window), 
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and Myc (e) (9 kb window) loci. Red arrows highlight the presence of xTSS- 
RNA. Four biological replicates were performed. f, Quantification of xTSS- 
RNA expression levels for AID target genes Myc, Igg1, Cd83, Pim1, Pax5 and 
Cd79b was obtained from RNA-seq RPKM values from two independent 
experiments. Indicated genotypes are on a ROSA260"""’* background. 
**P <0,01 (ttest). 
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Extended Data Figure 6 | Translocation hotspots are enriched for xTSS- 
RNA expression. a, Strand-specific RNA-seq mapped reads at AID target 
genes Myc, Cd83, Pim1, Pax5 and Cd79b. Green and red peaks indicate sense 
and antisense reads, respectively. Red bars represent RefSeq annotation of gene 
exons. Asterisks indicate the location of TSSs. Arrows indicate the orientation 
of coding strand transcript. Data were compiled from two biological replicates. 
b, Boxplot analysis of the level of expression of xTSS-RNAs at various genes 
reported to undergo recurrent AID-dependent translocations at DNA double- 
strand breaks generated within the Igh (left panel) or Myc (right panel) loci. 
Boxplots represent median values compiled from two biological replicates. 
Whiskers represent 99% of data values. **P < 0.01 (Wilcoxon rank-sum test). 
c, The list of 40 genes that show an overlap of translocation hotspots and xTSS- 
RNA expression (from Fig. 3c) was evaluated directly for xTSS-RNA levels (left 
panel) and mRNA levels (right panel). Statistical analysis was as described in 
b. **P < 0.01; NS, not significant (Wilcoxon rank-sum test). d, xTSS-RNA 
expression levels in Exosc3-deficient B cells at non-recurrent and recurrent 
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AID-dependent translocation sites in the B-cell genome. Data were compiled 
from two biological replicates. **P < 0.01 (Wilcoxon rank-sum test). 

e, Statistical analysis of the probability of identification of 40 random xTSS- 
RNA-expressing genes solely based on expression level. Ten-thousand control 
group genes were randomly selected that were expressed at similar levels as 
translocation hotspots genes. Specifically, to generate one random control 
group, we exhausted all translocation hotspots to find genes with similar 
expression levels (difference of RPKM < 0.5), and randomly picked up one for 
each hotspot. Ten-thousand gene lists were obtained that contain 88 genes and 
share the same expression profile with the translocation hotspots list. We then 
simulate the distribution of genes containing xTSS-RNA by overlapping the 
random control groups and actual xTSS-RNA gene list. The binomial fitting 
(red curve) shows that the number of overlapping genes of real translocation 
hotspots is significantly higher than random controls. **P < 0.01 (binomial 
distribution). 
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Extended Data Figure 7 | RNA exosome substrate antisense transcripts are 
expressed within gene bodies and regulatory regions containing AID- 
induced translocations. a, Association of genes with TSS-RNA expression, 
antisense transcripts and AID-induced translocations. The xTSS-RNA and 
antisense transcripts groups were compiled from four and two biological 
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replicates, respectively. b, Examples of genes with asRNA transcription were compiled from two biological replicates. 
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(Exosc3" 7 in blue and Exosc3¢O/©O%’ iin red) at regions that have been 
shown to have translocations from the Igh locus (translocations indicated in 
black). Data were compiled from two biological replicates. c, Translocations 
present in the upstream regulatory regions of AID target gene Cd83 (top panel) 
occur over regions of RNA exosome-sensitive antisense transcription. Data 
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Extended Data Figure 8 | Genome-wide analysis of xTSS-RNA expression at 
genes with AID, Spt5 and RPA occupancy. a, Scatter plot of antisense (left) 
and sense RNA transcription (right) in Exosc3””7 and Exosc3°O1WON 

transcriptomes. Data were compiled from two biological replicates. b, Genome- 
wide analysis of xISS-RNA expression at genes that are expressed and possess 
or lack AID and/or Spt5 occupancy. Values beneath each group represent the 
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number of genes with indicated occupancy. P values were determined by 
Wilcoxon rank-sum test. c, Overlap of genes with xTSS-RNA transcription 
(pink), recurrent AID-dependent chromosomal translocations (green) and 
RPA occupancy in the mouse B-cell genome (brown). The xTSS-RNA group 
was compiled from four biological replicates. **P < 0.01 (Fisher’s exact test). 
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Extended Data Figure 9 | RNA exosome and AID recruitment to 
divergently transcribed promoter regions. a, ChIP was performed using anti- 
H3S10ph or control IgG. Quantitative PCR and data analysis were performed 
as described in Fig. 4c. **P < 0.01 (t-test). b-d, Representative plots of highly 
expressed non-divergent genes used as controls for ChIP experiments in Fig. 4. 
These genes are Cmas (b), Fam107b (c) and March2 (d). Four biological 
replicates were performed. e, Exosc3 occupancy at divergent and non-divergent 
promoters. ChIP was performed using anti-Exosc3 (Genway) or control rabbit 
IgG. Quantitative PCR was performed using primers specific for sequences 
upstream of the indicated gene TSS. Data are represented as Exosc3 enrichment 
relative to input. Data represent mean values from three technical replicates. 
Error bars represent s.d. **P < 0.01 (t-test). f, g, CRISPR/Cas9-mediated 
deletion strategy of Cd83 (f) and Pim1 (g) xTSS-RNA-expressing regions in 
CH12F3 B cells. Locations of CRISPR/Cas9 guide RNAs (red markings), 
genotyping primers (blue triangles), ChIP primers (green triangles), and 
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xTSS-RNA region-deleted CH12F3 clones. i, Cd83 and Pim1 mRNA 
expression in xTSS-RNA region-deleted CH12F3 cells. Data represent mean 
values from three technical replicates. *P < 0.05 (t-test). j, Deletion of Cd83 
xTSS-RNA-expressing region impairs SHM. Parental CH12F3 or Cd83 xTSS- 
RNA region-deleted CH12F3 cells were transduced with lentiviral AID and 
mutation frequency was determined within a 488 bp region beginning 
approximately 150 bp downstream of the Cd83 TSS. All mutations were derived 
from unique clonal amplified sequences. Impairment of Cd83 SHM in Cd83 
xTSS-RNA region-deleted cells is disproportionally greater than mRNA 
expression change observed in i. Number of sequenced clones for parental and 
Cd83 xTSS-RNA region-deleted CH12F3 was 69 and 102, respectively. 
Background mutation frequency was determined using uninfected control 
CH12F3 cells and subtracted from the mutation frequencies indicated. 

**P < 0.01 (proportion test). 
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Extended Data Figure 10 | A model of RNA exosome recruitment to on the antisense strand due to formation of short asRNAs”. Similarly, in the 
divergently transcribed promoters or at DNA sequences that promote RNA __ body of transcribed genes, stalled RNA Pol II generates asRNA transcripts, 
Pol II stalling. a, Divergent transcription of mRNA in the sense direction leading to RNA exosome and AID recruitment. b, Stalled transcripts either 
recruits RNA exosome and AID following stalling due to various transcription _ close to the TSS or within the body of genes generate DNA-RNA hybrids. 
impediments (G-richness in IgH switch sequences is one example). These DNA-RNA hybrids contain RPA-coated ssDNA structures that are 


Transcription stalling leading to RNA exosome recruitment occurs more often __ targets of AID. 
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In this Letter, Figs 2 and 3 contained several minor errors, which have 
now been corrected. In Fig. 2c, we did not include the possible path- 
way from deciduous and freezing unexposed to evergreen and freez- 
ing exposed. This omission slightly alters the relative likelihood of the 
different pathways out of the evergreen and freezing unexposed state 
(<2%), but the interpretation is the same. In Fig. 2d, we also note that 
the arrow leading from large conduits and freezing unexposed to large 
conduits and freezing exposed and the arrow leading from large con- 
duits and freezing exposed to small conduits and freezing exposed were 
switched when generating the figure. In general, the scale of the circles 
(persistence times) and arrows (transition rates) in Figs 2 and 3 were 
also found to be confusing. We have now corrected Figs 2 and 3 online 
such that the scale matches a discrete binning of the persistence times 
and transitions rates for each character state combination. We thank 
E. Edwards for bringing these issues to our attention. Finally, in Extended 
Data Table 3, we note an incorrect transition rate was provided for the 
transition from woody unexposed to woody exposed for the Super- 
rosidae; the transition rate should be 0.01, not 0.001, and this has also 
now been corrected online. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature13877 


Corrigendum: Connectomic 
reconstruction of the inner 
plexiform layer in the mouse retina. 


Moritz Helmstaedter, Kevin L. Briggman, Srinivas C. Turaga, 
Viren Jain, H. Sebastian Seung & Winfried Denk 


Nature 500, 168-174 (2013); doi:10.1038/nature12346 


It has been brought to our attention that Supplementary Data 7, 
reporting the correspondence of our cell type definitions to those reported 
in the literature, contained sorting errors in the first two columns of the 
table. The correct table is shown in the Supplementary Information to 
this Corrigendum. 

We would also like to clarify that although some authors distinguish 
between the amacrine cell types A2 (receiving cone bipolar input) and 
AII (part of the rod bipolar pathway), we have used ‘A2’ in reference to 
the amacrine cell of the rod bipolar pathway, only. Furthermore, our clas- 
sification of cone bipolar cell types CBC1 and CBC2 needs to be treated 
with caution. To distinguish CBC1 from CBC2 we primarily used the 
width of axonal stratification along the light axis, with CBC] as the wider 
cells (on the basis of the morphological sketch in ref. 1 (ref. 28 in original 
Article)), and the mosaic fit as a secondary criterion. We used the pre- 
valence reported in ref. 1, and found reasonable but not perfect mosaics 
for both CBC1 and CBC2 cells, and we interpreted this as confirmation 
of our choice of assignment. We have, however, since discovered that 
starting with the converse assumption, that the narrower cells correspond 
to CBC] rather than CBC2, but otherwise proceeding in the same way 
we find mosaics of similar quality. The alternative sorting of cells is: 
CBCI1 (n = 26, matrix IDs 393, 400, 403, 410, 412, 414, 415, 417, 418, 
420, 421, 422, 424, 427, 430, 431, 432, 433, 436, 438, 440, 441, 444, 445, 
447, 449); CBC2 (n = 34, matrix IDs 390, 391, 392, 394, 395, 396, 397, 
398, 399, 401, 402, 404, 405, 406, 407, 408, 409, 411, 413, 416, 419, 423, 
425, 426, 428, 429, 434, 435, 437, 439, 442, 443, 446, 448). In fact, on 
the basis of the degree of axonal overlaps that remains with either sort- 
ing we cannot rule out the possibility that there may be a third sparse 
bipolar cell type among the CBC1 and CBC2 cells. 


Supplementary Information is available in the online version of this Corrigendum. 


1. Wéassle, H., Puller, C., Muller, F. & Haverkamp, S. Cone contacts, mosaics, and 
territories of bipolar cells in the mouse retina. J. Neurosci. 29, 106-117 (2009). 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature13841 


Corrigendum: A microbial 
ecosystem beneath the West 


Antarctic ice sheet 


Brent C. Christner, John C. Priscu, Amanda M. Achberger, 
Carlo Barbante, Sasha P. Carter, Knut Christianson, 
Alexander B. Michaud, Jill A. Mikucki, Andrew C. Mitchell, 
Mark L. Skidmore, Trista J. Vick-Majors & the WISSARD 
Science Team 


Nature 512, 310-313 (2014); doi:10.1038/nature13667 


During the preparation of the manuscript, author Huw Horgan was 
inadvertently excluded from the list of authors for the WISSARD Science 
Team. The HTML and PDF versions of this Letter have been corrected. 
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MOLECULAR BIOLOGY 


Genetic touch-ups 


Simplified techniques have made the field of gene editing 
much more accessible to non-specialists. 


BY JEFFREY M. PERKEL 


aking precision changes in the 
M genetic code of living cells has now 
become so easy that the power of 
genome editing can be harnessed by any- 
body with basic skills in molecular biology 
(see ‘Learning the ropes’). 
The ease is mainly down to the develop- 
ment of two technologies that can be cus- 
tomized to target specific DNA sites. The 


technologies — known as transcription 
activator-like effector nucleases, or TALENs, 
and clustered regularly interspaced short pal- 
indromic repeats (CRISPR-Cas) — are both 
much simpler to use than earlier techniques 
and considerably cheaper and easier to make. 

This combination means that the field of 
genome engineering is much more accessible 
than it was a few years ago, when it required 
advanced expertise in techniques such as 
protein engineering, DNA repair and ways 


to get nucleic acids into cells. The develop- 
ments have opened up job opportunities 
along three axes: solving basic biological 
problems, developing improved technology 
and finding potential therapies for diseases. 

Eric Hendrickson, a biochemist at the 
University of Minnesota Medical School in 
Minneapolis, says that the development of 
CRISPR-Cas was like an “earthquake” in 
the life sciences. He had spent years trying 
to perfect a more complex editing system, 
but migrated most of his work over to the 
CRISPR-Cas system within a few years of 
its development. Despite having been in 
the business for 30-35 years, Hendrickson 
says that he has never seen anything sweep 
through science as rapidly as CRISPR-Cas 
has. 

The new ease in editing may be a double- 
edged sword, however, because what once 
was a rare skill set has now effectively 
become commonplace. “For the past decade, 
if you could go into any job interview and 
say, And by the way, I can do gene targeting; 
that was always a big selling point,” he says. 
Today, it holds much less sway. 

But it has also boosted the field. Huimin 
Zhao, a chemical and biomolecular engi- 
neer at the University of Illinois at Urbana- 
Champaign, says that genome engineering is 
one of the most active subareas of synthetic 
biology, his research focus. 

And Daniel Voytas, a plant researcher and 
director of the Center for Genome Engi- 
neering at the University of Minnesota, says 
that he screens postdoc applicants not for 
advanced editing skills but for what might 
be called genetic green fingers — the ability 
to genetically modify plant cells and grow 
them into functional plants. 


TECHNICAL JUMP 

Some techniques still require advanced 
expertise. Farjana Fattah, a postdoctoral 
researcher at the University of Texas South- 
western Medical School in Dallas, says 
that edits that replace one sequence with 
another require more technical know-how 
than those that simply knock out genes, for 
example. 

Fattah developed the ability to make such 
complicated edits while she was doing her 
PhD with Hendrickson, and hopes to capi- 
talize on it with a job in biotechnology. 

Skills beyond genome editing, such as 
protein engineering, are also necessary for 
those interested in designing the next > 
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> generation of editing tools. When hiring 
postdoctoral researchers for such projects, 
says geneticist George Church of Harvard 
Medical School in Boston, Massachusetts, he 
likes to see experience with genome editing 
or related technology, but it is not crucial. 
Scientific creativity is, however. “As we're 
trying to develop transformative, disruptive 
technologies, maybe there's a slightly higher 
emphasis on people who think out of the box 
and are willing to fail quickly and move on,” 
he says. 

Those qualities are also in demand at 
companies that develop commercial edit- 
ing tools, such as Sigma-Aldrich of St Louis, 
Missouri. Sigma-Aldrich looks for can- 
didates with skills in bioinformatics, cell 
culture and genotyping of recombinant cells 
and animals, says Greg Davis, the compa- 
ny’s research and development manager for 
molecular biotechnology. But experience in 
using editing systems to make research tools 
is also a plus, Davis says. “Then you know 
that they understand the basics of the tech- 
nology coming in. And they can be ready to 
implement it immediately or improve on it 
once they come into the company.” 


DEVELOPERS 

Genome-editing technology is also 
becoming more popular in the therapeutics 
sector owing to its potential in reversing 
genetic disorders and aiding in drug devel- 
opment. AstraZen- 
eca recently started 


“Maybe there’s 

aslightly higher looking for people 

emphasis on for postdoctoral 
a le who positions in pre- 

think out of the cise genome edit- 

box.” ing in Sweden and 


the United King- 
dom. Mohammad 
Bohlooly, associate director of research 
and development, says that the company 
uses genome-editing technology to create 
cell and mouse models for identifying and 
validating potential drug targets, and has 
ramped up its hiring of both postdoctoral 
fellows and research scientists. 

Sangamo BioSciences in Richmond, 
California, develops and uses the older zinc- 
finger protein (ZFP) technology to develop 
therapeutics. The company has grown from 
80 to about 100 employees over the past few 
years as it moved into clinical trials, says 
Philip Gregory, chief scientific officer and 
senior vice-president for research. 

The pool includes a large group of people 
who focus on protein design, Gregory 
says — a reflection of the fact that ZFPs 
are harder to work with than TALENs or 
CRISPRs — but the company is also inter- 
ested in researchers who understand the 
processes of DNA repair and gene regulation 
and can apply those to therapeutics. 

Smaller biotech companies are also 


EDUCATION 


Learning the ropes 


The technical bar for genome editing 

is now relatively low: you need a basic 
knowledge of molecular biology, 

some bioinformatics skills and a good 
understanding of the mechanisms of DNA 
repair. “Anyone with even master’s-level 
skills in molecular biology can understand 
the process of making the reagents and 
could get started on a genome-engineering 
project,’ says Daniel Voytas, a plant 
researcher and director of the Center for 
Genome Engineering at the University of 
Minnesota in Minneapolis. 

But investment in mastering the 
process — and identifying problems to 
apply the techniques to — can yield new 
skills, potential collaborators and job 
opportunities. Many resources are now 
available to help researchers do just that, 
including web tutorials, short courses and 
conferences on various aspects of the 
technologies, particularly the newer ones, 
known as clustered regularly interspaced 
short palindromic repeats (CRISPR) 
and transcription activator-like effector 
nucleases (TALENSs). 


For the dedicated learner 

Young researchers can establish genome- 
editing credentials by finding a compelling 
application in their own work: the tools 

are affordable and relatively easy to use. 
“Most labs would be delighted to have their 
graduate student do a CRISPR experiment 
as part of their thesis, and that makes 
them very hireable,” says George Church, 
a geneticist at Harvard Medical School in 
Boston, Massachusetts. Opportunities for 
practical experience include: 

@ Workshop on CRISPR-Cas gene 
targeting in mice in Bar Harbor, Maine 
(5-7 November 2014) 
go.nature.com/eeydd6 

@ RNA Institute symposium on genome 
editing with CRISPR-Cas in Albany, New 
York (17-20 March 2015) 
go.nature.com/mzdr5h 

@ Wellcome Trust course on genetic 


growing. Editas Medicine of Cambridge, 
Massachusetts, will be adding a significant 
number of researchers in the next year or 
two, says chief operating officer Alexandra 
Glucksmann. 

And CRISPR Therapeutics in Basel, 
Switzerland, anticipates ramping up its 
current base of half a dozen staff research- 
ers and off-site consultants to a dozen or so 
staff researchers by the end of the year, says 
chief executive, Rodger Novak. It also plans 
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engineering of mammalian stem cells 
in Hinxton, UK (16-28 February 2015) 
go.nature.com/ivpior 


For the do-it-yourselfer 

Here are some web-based resources: 

@ Practical guide to CRISPR 
go.nature.com/xb3zqm 

@ Webinar on genome editing with CRISPR 
go.nature.com/n7gezu 

@ Review of CRISPR-Cas systems 
go.nature.com/yvedvr 

@ CRISPR developments 
go.nature.com/cyedsr 

@ A video on how to use CRISPR-Cas9 
go.nature.com/9zku7j 

@ TALEN-based genome editing 
go.nature.com/pwfdec 

@ CRISPR-based genome editing 
go.nature.com/vhzlog 

@ Review of genome editing 
go.nature.com/uulwiz 

@ Portal for designing CRISPRs 
go.nature.com/myqgyq 

@ Portal for designing TALENs 
go.nature.com/wxpdmv 

@ Genome-engineering resources 
go.nature.com/4vmv44 


For the networker 

Conferences are useful for making 

contact with academics and industry 
representatives who are knowledgeable 
about genome editing. A Federation of 
American Societies for Experimental 
Biology meeting in June provided many 
such opportunities, says biochemist Eric 
Hendrickson at the University of Minnesota 
Medical School in Minneapolis. Others 
include: 

@ Keystone Symposium on Precision 
Genome Engineering and Synthetic Biology 
in Big Sky, Montana (11-16 January 2015) 
go.nature.com/ij5xeh 

®@ Cold Spring Harbor meeting on The 
CRISPR/CAS Revolution in New York 
(24-27 September 2015) 
go.nature.com/rbpkeb JP. 


to double that by the end of next year. 

It is still too early to say whether genome 
editing will become a skill that, like PCR, 
most molecular biologists must learn, or 
a field of its own. But given the low bar to 
entry, it is a skill that savvy life scientists 
should consider learning and applying now, 
to get ajump on the competition. m 


Jeffrey M. Perkel is a freelance writer based in 
Pocatello, Idaho. 


Ua SCIENCE FICTION 


THE METHOD 


BY JON HURWITZ 


next, which would cement his position as 
abankable star. Theyd reported him get- 
ting $20 million for this movie. He wished! 

He wrote the field equations on the 
whiteboard at the back of their practice film 
set, and this time managed to get halfway 
through the second line. 

“No!” His technical adviser, the exacting 
Doctor Daker, flared his nostrils and ran a 
hand back through his prematurely white 
hair. “What did we say about using covariant 
derivatives?” 

According to the script, Jake needed 
to write equations while simultaneously 
speaking lines, then duplicate it exactly in 
close-up. A foreign-language Oscar perhaps, 
he thought, looking at the board. He went 
back to the previous term and added in the 
semicolon to Daker’s satisfaction before 
carrying on. He had three months before 
filming started to master the character ofan 
obsessed scientist who invents a subspace 
drive, and by then he needed to think like a 
physicist as well as write like one. 

Daker was tutting again. What now? 
Learning lines was never this tough. 

“Perhaps you should start over,’ Daker 
suggested. 

Jake took a deep breath. Best Actor Award, 
he told himself. 


Jie had hopes for an Oscar the year after 


Jake had covered the whiteboard behind him 
in symbols that he now almost understood. 
In front lay a mass of parts that Daker was 
explaining. What did he just say? 

“Are you seriously telling me real physi- 
cists make their own equipment?” 

Daker laughed. “Sure. It’s cheaper to fab- 
ricate parts from a 3D printer and we dont 
normally get access to a Hollywood produc- 
tion budget.” 

“What about these?” Jake surveyed the 
improbable components. 

“bought the cyclotron and the mass sepa- 
rator for the film, but most of the other stuff 
I’ve made over the years. There are some 
big-ticket items coming I could never have 
managed, including the superfluid cooling 
injectors. It has to look impressive for the 
cameras, right?” 

“Can you teach me how to make some of 
these?” If a real physicist did it, Jake wanted 
to do it at least once for the experience. He 
needed the emotional memory to draw on. 

Daker’s nostrils flared. “First we should 


What it takes. 


get you assembling the parts you're going to 
work with on set, no?” 

“OK, but let’s make something later. I 
need to know this stuff?’ Jake wasn't as dif- 
ficult as his reputation suggested. You got it 
right or you didn't, and he preferred to get it 
right. Surely that wasn't so hard for others, 
he reckoned, even when it was damned dif- 
ficult for himself. 


Daker fussed as Jake bolted the injectors to the 
3-metre carapace, precisely opposite the inlets. 

“So you're a method actor? Stanislavski, 
right?” 

“Method yes, but Strasberg” 

Jake's fingers moved more certainly now 
that he understood how the larger pieces 
would have to fit together. He might have to 
slow it down a bit for the performance. Real- 
ity differed on screen; you found the truth 
behind the truth. 

“Strasberg’s method is better for scripted 
Hollywood, Jake explained, still working. 
“You go to character motivation, rather than 
asking what would I do?” 

“Good, said Daker, as Jake finished tighten- 
ing the last bolt. “Time to hook up the power.” 

Jake hardly noticed how nervous Daker 
looked. This was a step further than theyd 
previously gone and he concentrated on con- 
necting the wiring. 


March came and filming started. The prac- 
tice set had been dismantled and shipped 
to the main studios, 


> NATURE.COM where they’ rebuilt 
Follow Futures: it with gaps for the 
© @NatureFutures cameras. Other spaces 


Ei go.naturecom/mtoodm were filled by the 
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director, the technicians, Daker and even 
the producers who had come to watch the 
first day's shoot. 

In his trailer Jake was becoming Doctor 
Han Selig, inventor of the subspace drive, a 
role that leaned on many of Daker’s man- 
nerisms, adding the hyper-realism only a 
method actor could bring to bear. They'd be 
filming interior scene 25, the completion of 
the drive, where Selig switched on his inven- 
tion for the first time. 

Jake walked from his trailer to the set and 
stood before Selig’s engine, his life’s work. Of 
course he was compulsive, even obsessive; 
how else could he achieve anything worth- 
while? Han Selig would show them what it 
took to be extraordinary. 

“Action!” 

He coupled the power plant to the main 
drive connectors, appearing hesitant, and 
flipped the switch. The 2-tonne engine 
bobbed up as it had in preparation and he 
allowed a small smile of triumph to grow, one 
hed practised many times to ensure it would 
match the close-up they‘ film next. He rana 
hand backwards through his hair before lift- 
ing the engine bodily upwards another metre, 
where it remained, hovering in mid-air. 

He was too busy being Selig to react to the 
surprise on the watching faces. 


Doctor Daker had high hopes for a Nobel. 
As he watched, mirroring Jake’s triumphant 
smile, he fingered a 20-year-old rejection 
from Nature magazine that hed brought with 
him to sweeten the moment. “Unprovable 
physics has no place ina peer-reviewed jour- 
nal,” that’s what it said, as did rejections from 
other journals. How could he get proof? 
“We cannot support experiments based on 
unpublished physics,’ the grant committees 
had written. 

Daker could never have afforded the 
nearly $2 million needed to prove his theo- 
ries, but he had raised the much smaller sum 
required to hire a great scriptwriter. 

The cameras recorded the first moment 
his engine rose in public and Daker hoped 
theyd captured the equations from the 
whiteboard. It probably didn’t matter, he 
concluded after a time. He may have pub- 
lished first in Disney, but when word got out, 
surely Nature would reprint. = 


Jon is an IT analyst living in London. 
Educated in physics and bioinformatics, he 
writes science fiction for his own amusement 
and in the hope that it amuses others. 
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Laureate Meetings can be a life-changing experience 

for many of the 600 or so young scientists who attend. 
Researchers, all aged under 35, are selected from thousands of 
applicants from more than 80 countries and, this year, some 
were lost for words when asked to sum up the experience of 
what it meant to spend a week mingling with their scientific 
heroes on the German island of Lindau. 

After all, where else can you rub shoulders with the 
discoverer of HIV, the person who uncovered the genetic 
foundations of cancer, or the scientist who risked his life to 
prove that stomach ulcers are caused by a bacterium? 

This year’s Lindau meeting, the 64th held since 1951, was 
themed physiology or medicine and took place between 
29 June and 4 July, with 37 laureates in attendance. For the first 
time, there were more female young researchers than male. 

Some laureates were familiar faces, such as Werner Arber, 
for whom it was his 26th visit. Others, including Michael 
Bishop, Jules Hoffmann and Barry Marshall, were new to 
the experience. Despite a busy schedule, the laureates clearly 
enjoyed exchanging ideas with the next generation. 

Taking inspiration from the opening lecture by Randy 
Schekman, who shared the 2013 Nobel prize for work on the 
cell’s internal transport systems, we report on the part played 
by autophagy in conditions such as cancer and Alzheimer’s 
disease (page S2). There are discussions — initiated by 
Nature Video and available at www.nature.com/lindau/2014 
— between young researchers and laureates on the science 
and ethics of ageing (S14) as well as Q&As with six laureates, 
conducted and written by young scientists (S5). 

We are pleased to acknowledge the financial support from 
Mars, Incorporated in producing this Outlook. As always, 
Nature has sole responsibility for all editorial content. 


E is no exaggeration to say that the annual Lindau Nobel 
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MOLECULAR BIOLOGY 


Remove, reuse, recycle 


Waste removal is not usually described as sexy, but the once-neglected field of autophagy 
— which plays a part in cancer and other diseases — is a hot topic in biomedical research. 


BY MICHAEL EISENSTEIN 


hen Ana Maria Cuervo began 
researching her thesis in autophagy 
— a cellular recycling mecha- 


nism — little did she know that two decades 
later she would be working in one of the most 
dynamic fields of medical research. Randy 
Schekman, winner of last year’s Nobel Prize 
in Physiology or Medicine, even chose to talk 
about autophagy in his opening address to the 
37 laureates and 600 young scientists at this 
year’s meeting instead of cellular trafficking 
— his prizewinning work. Cuervo is accus- 
tomed to this rise in interest. “I did my thesis on 
autophagy in the early 1990s when autophagy 
wasn't cool,” says Cuervo, who is now co-direc- 
tor of the Einstein Institute for Aging Research 
at the Albert Einstein College of Medicine in 
New York City. “When I finished, everybody 
told me to change fields because autophagy was 
a dead end,’ she confesses. Studies have proved 
this prediction to be spectacularly wrong. 


Autophagy was once considered to be little 
more than a cellular recycling bin — a process 
by which cells break down unwanted biomol- 
ecules into raw materials. But more recent 
research has revealed that autophagy is, in fact, 
a nexus for the cellular stress response and a 
failure point for many diseases. In the past 
ten years, researchers have made connections 
between autophagy and the immune response, 
cancer, neurodegeneration and ageing, says 
Daniel Klionsky of the University of Michigan 
in the United States. “The field just exploded.” 


A PROMOTION FROM HOUSEKEEPING 

There are different types of autophagy, but the 
best-understood pathway is known as ‘macro- 
autophagy’ — a bulk mechanism for gather- 
ing up and degrading proteins, organelles and 
other cellular materials. The process begins 
with the formation of a double-membrane 
structure known as a phagophore, which elon- 
gates and engulfs nearby cellular components 
(see ‘Eating up the cell’). 
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Autophagy was discovered in the 1960s, 
based on microscopic observations of selec- 
tive degradation of cellular material within the 
lysosome (see ‘A history of autophagy’). Over 
time, scientists accumulated evidence that this 
process helped cells to deal with nutrient-poor 
conditions, to eliminate excess proteins and 
even to remove entire mitochondria — the 
cell’s metabolic power plants. However, most 
functions seemed to fall under the umbrella 
of basic maintenance, and autophagy research 
remained a niche field. 

The turning point that showed autophagy 
was not simply cellular housekeeping came 
in the mid-1990s, when a number of proteins 
(now known as Atg proteins) that collectively 
mediate the formation and maturation of the 
phagophore were reported. Since then it has 
become clear that the Atg machinery inter- 
sects with physiological processes underlying 
an array of disorders, but scientists are still 
struggling to figure out the conditions that 
autophagy prevents or promotes. 


NILS-PETTER EKWALL 


CANCER CONTROVERSY 

Autophagy seems to provide a crucial bulwark 
against genetic and biochemical damage — for 
example, by eliminating damaged mitochon- 
dria that would otherwise leak toxic molecules 
into the cell. As such, itis perhaps unsurprising 
that cancer was the first disease to be linked 
with autophagy. However, current evidence 
suggests that autophagy can act as both an ena- 
bler of and a protector against tumour growth, 
creating some debate in the field. 

In 1999, Beth Levine and her colleagues at 
Columbia University, New York, showed that 
a protein called beclin-1 suppresses tumour 
activity in humans and promotes early forma- 
tion of the phagophore’. The group also found 
that several cellular pathways that drive tumour 
growth inhibit autophagy, either by preventing 
activation of beclin-1 or by interfering with 
other Atg proteins. Levine is waiting for proof 
before declaring that autophagy failure itself 
drives tumour growth, but she believes it makes 
for a compelling hypothesis. “The general view 
is that autophagy plays a protective role against 
the development of cancer,’ she says. 

However, some scientists believe that 
autophagy can also help advanced tumours to 
thrive by allowing cancerous cells to cope with 
the stress associated with competing for lim- 
ited nutrients and oxygen, not to mention the 
toxicity caused by radiation or chemotherapy. 
Autophagy inhibitors could, therefore, render 
established cancers more vulnerable to treat- 
ment, says oncologist Ravi Amaravadi at the 
University of Pennsylvania in Philadelphia. 
“The overarching theme is that autophagy is an 
adaptive stress response that protects the cancer 
cell in advanced disease,” he says. 


KEEPING A CLEAR MIND 

But it is not only cancer that is linked to the 
failure of autophagy — it also seems to playa 
key part in neurodegenerative disorders such 
as Alzheimer’s, Parkinson’s and Huntington’s 
diseases. These conditions are characterized 
by the formation of dense protein aggregates, 
which point to some sort of failure in cellular 
housekeeping, but disruptions vary consider- 
ably between the conditions. 

For example, neurons in Alzheimer's patients 
exhibit increased numbers of autophagosomes, 
the membranes that enclose the cell compo- 
nents before they are broken down, yet they can 
no longer fuse effectively with the lysosome. 

Although the roots of Alzheimer’s pathology 
remain unclear, with toxicity linked to accumu- 
lation of two proteins called tau and amyloid-B 
(AB), autophagic failure could provide a rea- 
sonable explanation for either pathway. “At 
late stages of disease you get what looks like an 
autophagy blockade that might compromise 
the whole process,” says neuroscientist David 
Rubinsztein at the University of Cambridge, 
UK. “That's going to affect not only tau and AB 
clearance but also removal of damaged mito- 
chondria and other processes.” 


By contrast, some forms of Parkinson’s 
are associated with disruptions in a paral- 
lel autophagy pathway called chaperone- 
mediated autophagy in which specific proteins 
are delivered directly to the lysosome for deg- 
radation by means ofa protein called LAMP2A 
without involvement of the autophagosome. 

One of the proteins normally removed 
by this process is a-synuclein, the plaque- 
forming protein associated with Parkinson's. 
Mutant forms of the protein or an excessive 
production of it can gum up the system and 
cause a gradual but steady decline in neuronal 
health. “Chaperone-mediated autophagy 
cannot remove the molecules at the normal 
rate, and the protein begins to accumulate,” 
says Cuervo. The normal autophagic process 
can compensate to a certain extent. However, 
as Rubinsztein and others have observed, 
a-synuclein can also exacerbate the condition. 


CONSTRUCTIVE FEEDBACK 

The impact of autophagy goes beyond the 
confines of an individual cell — this process 
is also used to regulate metabolic function 
throughout the entire body. Cuervo and her 
team recently found’ that the liver helps to 
manage metabolism by using chaperone- 
mediated autophagy to selectively destroy the 
enzymes that convert sugar into energy. This 
is crucial, says Cuervo, because otherwise the 
liver becomes a “selfish organ” that uses all 
the glucose for itself at the expense of other 
tissues. Along with her colleague Rajat Singh, 
she has also found’ that nutrient-sensing 
functions mediated by autophagy help the 
brain to convey that it is time to eat by switch- 
ing on appetite signals and switching off those 
that indicate satiety. 

Elements of the autophagy machinery also 
act as a line of defence against viruses and bac- 
teria by diverting would-be cell hijackers to 
the lysosome for destruction. Microbiologist 
Vojo Deretic at the University of New Mexico in 
Albuquerque hypothesizes that the autophagy 
machinery may have served as a primordial 
form of immunity in early evolutionary history, 
by helping the body to distinguish between 
molecular signatures that represent foreign 
threats and those that are indicators of ‘self’ 
and should be ignored. 

Many pathogens have evolved strategies that 
can sabotage autophagy, which Deretic first 
encountered while attempting to understand 
how Mycobacterium tuberculosis lives inside 
immune cells. He found’ that the bacteria were 
escaping destruction by selectively attacking a 
molecule that would otherwise transport them 
to the lysosome. Likewise, Levine has observed” 
that the herpes virus thwarts autophagy to sur- 
vive within neurons. “It has a protein that binds 
to the beclin-1 protein and blocks its function,” 
she says. “This is not necessary for viral replica- 
tion in vitro but is essential for replication in 
neurons, and this meant that viral evasion of 
autophagy was necessary for disease.” 


MEDICAL RESEARCH Wiejtyyiele)i 


EATING UP THE CELL 


Autophagy is part of a cell’s normal function, 
removing proteins, damaged organelles and other 
unwanted material. Failure of the system is 
implicated in a number of conditions and ageing. 
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With so many crucial processes seemingly 
converging on a single cellular pathway, the 
expectation is that failures in autophagy have 
far-reaching consequences throughout the 
body. The evidence now strongly suggests that 
ageing is associated with a decline in autophagy, 
and some researchers are intrigued by the strik- 
ing overlap among conditions that are associ- 
ated with both ageing and autophagy, such as 
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diabetes, cancer and neurodegenerative dis- 
ease. Cuervo and her colleagues have found 
evidence that chaperone-mediated autophagy 
might be an important factor in healthy human 
ageing. For instance, her team learned that the 
receptor in this pathway (LAMP2A) normally 
decreases with age, and therefore reduces the 
cell’s ability to degrade proteins, which Cuervo 
believes could increase the risk of metabolic 
diseases. This initiates a vicious circle, whereby 
failure to control enzymes that break down 
sugar and fat leads to their steady accumulation 
in the body which, in turn, further suppresses 
autophagy. The inability of the cell to main- 
tain its internal environment could be linked 
to other ageing-associated disorders, too. “It's 
like my mother used to say: in a clean house, 
everything works better,” says Cuervo. 

Conversely, other tricks to boost longevity 
seem to demand healthy autophagic function. 
For example, caloric restriction — in which 
subjects greatly reduce their food consump- 
tion without crossing the line into malnutri- 
tion — has been strongly linked with increased 
lifespan in many animal models. These same 
physiological conditions also stimulate 
autophagy, offering tantalizing evidence that 
these two processes — autophagy and the 
longevity gains associated with restricted 
caloric intake — may be linked. Research from 
Levine’s group has also shown that exercise 
can stimulate autophagy, and she speculates 
that our well-fed and sedentary contemporary 
lifestyles may suppress our capacity to maintain 
the high level of autophagy that helped to keep 
our ancestors healthy. 


HUNGRY FOR NEW THERAPEUTICS 

The potential link between increased 
autophagy and better health could be good 
news from a therapeutic perspective. The 
Levine group has developed a promising 
molecule that can stimulate autophagy, pro- 
tecting mice against otherwise-lethal viral 
infections and blocking the accumulation of 
proteins associated with neurodegenerative 
disease in cultured cells. 

Rubinsztein’s team has obtained promising 
preliminary results in mice with an autophagy- 
stimulating drug called rilmenidine, which has 
already been approved for treating high blood 
pressure in the United States and Europe. The 
drug is being tested in an ongoing clinical trial 
for safety in patients with Huntington's disease, 
and Rubinsztein hopes to move towards effi- 
cacy trials in patients with early stage neuro- 
degenerative disease — an area where many 
clinical researchers see the greatest promise in 
autophagy-targeting therapeutics. 

Several pharmaceutical companies, includ- 
ing Novartis, Pfizer and Millennium, are test- 
ing the waters, primarily focusing on inhibiting 
autophagy to make cancers more susceptible 
to chemotherapy treatment. In August 2014, 
Amaravadi and his colleagues published half 
a dozen phase I trials in which they paired 


BACK IN TIME 
A history of autophagy 


The story of autophagy begins with 
Belgian cell biologist Christian de Duve, 
who shared the Nobel prize in 1974 
for his exploration of the structural 
and functional organization of the cell. 
De Duve discovered the lysosome, an 
acidic membrane within the cell that is 
loaded with enzymes that can digest 
biomolecules. In the 1960s, scientists 
learned that proteins and structures 
called organelles within cells were being 
scooped up from the cytoplasm and 
delivered to the lysosome for destruction 
and recycling. De Duve coined the term 
‘autophagy’ to describe such cellular 
self-cannibalization. 

Early studies linked autophagy to 
the body’s ability to sense nutrients, 
suggesting that the process enables cells 
to obtain raw materials during starvation. 
Once this model was established, interest 
in the subject waned. Things changed 
in the mid-1990s when researchers 
began to untangle the mechanisms that 
drive autophagy. From studies in simple 
organisms, such as yeast, scientists 
built genetic and functional maps of the 
machinery used in autophagy. It became 
clear that autophagy was conserved 
throughout evolution and served a more 
crucial purpose than just providing 
emergency rations to cells. M.E. 


different cancer treatments with hydroxy- 
chloroquine, an antimalarial drug that also 
impairs lysosomal degradation. Although the 
results were ambiguous in terms of efficacy, the 
safety profile seems favourable. Amaravadi also 
reported evidence of stalled autophagy in blood 
cells and tumour tissue from patients treated 
with the highest doses of hydroxychloroquine, 
suggesting that this drug or a related compound 
might be able to thwart a mechanism by which 
cancer eludes destruction. 

Given the ambiguous role of autophagy 
in helping or hindering cancer, experts 
have expressed concern that the genetic 
heterogeneity found within a typical tumour 
could make cancer too challenging a target for 
such a broad therapeutic approach. “I’m not 
optimistic that this pro-survival function of 
autophagy is going to be a good therapeutic in 
all or even most cancers,” says Levine. At least 
one study” suggests that inhibiting autophagy 
might instead provoke more aggressive tumour 
growth, although another study’ has contested 
those findings. For now, this remains a topic of 
considerable debate, and Amaravadi hopes to 
gain deeper insights in an upcoming phase II 
trial in patients with pancreatic cancer. “This 
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is very important because it’s randomized, so 
if there’s a signal we'll know that it’s due to the 
hydroxychloroquine,’ he says. 

From a therapeutic perspective, hitting the 
wrong target could have dire consequences. “If 
youre having problems with autophagosome 
clearance, as has been shown with Alzheimer’s, 
then a drug that promotes autophagosome 
formation will just create more vesicles that 
arent going anywhere and make a bad traffic 
jam worse,’ says Cuervo. Furthermore, some 
viruses actually make use of the autophagy 
machinery to assist in replication, so the same 
drug that thwarts, say, herpes might encourage 
poliovirus proliferation and release. 

Additionally, many of the drugs being tested 
affect autophagy either incidentally or in con- 
junction with other cellular pathways, making 
it harder to determine whether autophagy is the 
culprit or the cure for a given condition. 


BACK TO BASICS 

Deretic has obtained promising early data from 
a compound that may help to contain the pro- 
liferation of HIV by means of autophagy, but 
wants to get a better insight into how the mole- 
cule works before getting too excited. “We have 
to be very careful about how we interpret the 
data and what we expect to see before we even 
start the experiment,” he says. “Is itan inducer 
or an inhibitor, and is it driving the whole pro- 
cess or just half of it? A lot of screening data stop 
short of answering these questions.” 

These questions become even harder 
to answer in the clinical setting, where 
researchers often rely on proxy indicators to 
glean static snapshots of a highly dynamic 
process. Klionsky has worked with many of 
the field’s top researchers to devise best prac- 
tices for studying autophagy, but it can still be 
fiendishly difficult to determine how a given 
experimental manipulation is altering the pro- 
cess — especially when one is targeting cells 
deep within the brain or liver. 

For this reason, some of the most important 
near-term studies in autophagy will be basic 
research efforts that monitor the nuts and 
bolts of the process. “We need to understand 
how autophagosomes are built, what regulates 
the way they form and what regulates their 
itinerary within the cell and fusion with lys- 
osomes,’ says Rubinsztein. “Having that toolkit 
expanded will give us more potential insights 
into links with different types of disease.” m 


Michael Eisenstein is a freelance journalist 
based in Philadelphia, Pennsylvania. 


= 


. Liang, X. H. et al. Nature 402, 672-676 (1999). 

2. Schneider, J. L., Suh, Y. & Cuervo, A. M. Cell Metab. 
20, 417-432 (2014). 

3. Rubinsztein, D. C. EMBO Rep. 13, 173-174 (2012). 

4. Vergne, |. et al. Proc. Natl Acad. Sci. USA 102, 
4033-4038 (2005). 

5. Orvedahl, A. et al. Cell Host Microbe 1, 23-35 

(2007). 
. Rosenfeldt, M. T. et a/. Nature 504, 296-300 (2013). 
. Yang, A. et al. Cancer Discov. 4, 905-913 (2014). 


NO 


CHRISTIAN FLEMMING/LINDAU NOBEL LAUREATE MEETINGS 


SANDOR VARSZEGI 


Q&A Jules Hoffmann 
Fighting fit 


Jules Hoffmann shared the 2011 Nobel prize in Physiology or Medicine for discoveries in 
the activation of innate immunity against bacteria and fungi in fruit flies. Now based at the 
Institute of Molecular and Cellular Biology at Strasbourg University in France, Hoffmann 
talks to Addm and David Tarnoki about how to use the immune system to kill cancer cells. 


What is our biggest health threat today? 

One of the most important discoveries in 
medicine was probably vaccination. For most 
of human history, people died from infec- 
tions. This is now largely under control and 
average life expectancy has doubled in the past 
100 years or so. However, we still do not have 
vaccines against a number of very important 
pathogens, such as HIV or Plasmodium, the 
agent of malaria, and we also have some vac- 
cines against established pathogens that do not 
fully protect people. With resistance against 
antibiotics becoming an increasing problem, 
vaccination has to be improved accordingly. 
In addition, we now face the problem of an 
ageing population in which cancer, neuro- 
degeneration, stroke and cardiovascular dis- 
eases are the major killers. Obesity is another 
key issue. Finally, we have to be careful about 
the effect of new materials or environmental 
toxins on our physiology in general, includ- 
ing our immune system, but we do not have to 
panic about this. 


Will we eventually be able to stimulate the 
immune system to kill cancer cells? 
This is a very important emerging field, and 


there is great hope that we will understand 
what induces an immune response against 
cancer cells. When you kill cancer cells using 
chemotherapy, those cells leak large numbers 
of molecules, some of which are thought to 
induce antibody formation against cancer 
cells. The immune system has checkpoints — 
proteins or inhibitory pathways — that prevent 
lymphocytes from overreacting and attacking 
normal tissue. The rationale here is that allevi- 
ating or inhibiting their functions in tumours 
will make reactions of the immune cells more 
aggressive and efficient. Indeed, clinical tri- 
als are underway indicating that this can bea 
promising avenue in curing some cancers. 


What is the secret to conducting Nobel 
prizewinning science? 

Science is a very stressful job because you have 
to choose the right field, get good results and 
then publish those results before your compet- 
itors. It demands full engagement and an enor- 
mous amount of work, so it is healthy to have 
other cultural interests and also a nice family 
life. I met my future wife when she was hired 
to work in our laboratory by my thesis advisor. 
It is very good when you have a partner who 
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understands and shares your commitment. 

Intellectual freedom is also crucial. From 
very basic, curiosity-driven research we ended 
up doing things that eventually turned out to 
be interesting for medicine. But we did not 
anticipate this when we set out. Basic science 
makes you ask questions and find results that 
suddenly open up to something that nobody 
knew before. 


What advice do you give to your students? 

I advise young students to choose a good 
subject and a good supervisor. In addition, 
I encourage them to be aware of all the pro- 
gress in their field, particularly regarding 
techniques. For example, in our research we 
had to immunize 100,000 flies individually 
in order to identify one inducible antifungal 
peptide, drosomycin, whereas today 20 would 
be enough because the technique has evolved 
so dramatically. Also, I tell them not to stick 
to the established techniques in their field: 
be open and interact with other fields. I was 
trained as a humble zoologist, but we had to 
get involved with cellular biology, biochem- 
istry, analytical chemistry, molecular biology 
and molecular genetics in order to achieve 
our research goals. Finally: work hard. My 
grandparents were butchers on one side and 
farmers on the other, and they worked very 
hard indeed. 


Do you always think and behave scientifically? 

I recently met some researchers at the Dead 
Sea in Israel, who had interesting results: they 
had cured three people with psoriasis and 
they wanted my opinion on it. I cautioned that 
because they did not have a full cohort show- 
ing the way the volunteers had been treated in 
the salty environment of the Dead Sea com- 
pared with a control group, they could not 
be sure of the reasons why the subjects were 
cured because of their work. This is scientific 
thinking, and it certainly influences the way I 
behave, but it’s not something you have to do 
all of the time. Some things I do don’t make 
much scientific sense. I choose not to drink 
alcohol at lunchtime, for instance, but in the 
evening will enjoy a good French wine. m 


Adam 

and David 
Tarnoki are 
identical twins 
working in the 
Department 
of Radiology and Oncotherapy at 
Semmelweis University in Budapest. 
They revived the Hungarian twin registry 
and perform twin studies in areas that 
include atherosclerosis, respiratory 
diseases and anthropometric traits 

to try to understand the epigenetic 
background of these diseases. 
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Q&A Barry Marshall 
A bold experiment 


Laureate Barry Marshall, professor of clinical microbiology at the University of Western Australia in Perth, tells Meghan Azad why he risked his 
health to prove his theory about the link between stomach ulcers and bacteria. He shared the 2005 Nobel prize with Robin Warren for discovering 
the stomach-dwelling bacterium Helicobacter pylori and for proving that it is this microorganism, not stress, that causes most peptic ulcers. 


What sparked your interest in science 

and medicine? 

From the first day I ever saw a bookI was very 
keen to read. My father was a tradesman, so 
I read about motor mechanics, electrical 
equipment and even thermodynamics, and 
my mother was a nurse, so she had anatomy 
and physiology books. Finding out how things 
worked was always a natural thing. I didn't 
intend to go into research but it was part of my 
medical training. I could have worked on lots 
of things besides the bacterium Helicobacter 
pylori, but that was the one that really took off. 


What drew you to ulcers and H. pylori? 

The conventional wisdom was that people 
developed ulcers because they were suffering 
from stress, which was thought to increase 
gastric-acid secretion to the point at which the 
stomach lining breaks down and a peptic ulcer 
forms. I was sceptical that stress caused physical 
diseases, and I certainly was not prepared to lie 


to patients by telling them that. So I looked fora 
more evidence-based cause. Every medical and 
microbiology textbook at the time stated that 
the stomach was sterile, so nobody had thought 
of doing a culture or looking for bacteria with 
a simple Gram stain, a laboratory technique 
used to identify species of bacterium. If they 
had, they would have found H. pylori in five 
minutes! There were a few paradoxical things 
that made H. pylori hard to find — for example, 
it is often not detectable in the vicinity of an 
ulcer. In fact, we had cultured the bacterium 
months before we realized that the species was 
important in the formation of ulcers. 


When did you realize your work might be 
worthy of a Nobel prize? 

Robin Warren and I were jointly awarded the 
Nobel prize. We first identified the association 
between bacteria and ulcers in late 1982, and 
this was followed by a period of hypothesis 
testing and extrapolation. In April 1983, we 
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carried out an experiment to prove that we 
could kill Helicobacter with ulcer drugs and I 
knew then that we were almost certainly on 
the right track. Then we had our first paper 
published in The Lancet and we went out to 
celebrate. Robin's wife said that we might win 
the Nobel prize and we joked that it might hap- 
pen within a couple of years, but I am glad it 
didn't. It would be difficult to have won a Nobel 
so early in your career — I think you would 
develop a big inferiority complex. 


You swallowed a culture of H. pylori to prove 
your hypothesis. What led you to do this, and 
what did your family and colleagues think? 

I was becoming increasingly frustrated 
because I was successfully treating stomach- 
ulcer patients with antibiotics but couldnt 
convince other doctors to use this approach 
without solid experimental evidence. I tried to 
infect piglets for six months, but piglets grow 
quickly, so it was a tough experiment to do. 


CHRISTIAN FLEMMING/LINDAU NOBEL LAUREATE MEETINGS 


P. MOTTA/SAPIENZA UNIV. ROME/SPL 


Without data proving that I could reproduce 
an ulcer by infecting an animal with H. pylori, a 
human experiment was the only option. When 
I decided to drink the Helicobacter culture I felt 
a bit embarrassed, and I didnt really discuss it 
with my bosses in case they forbade me to do 
it. But I suspect they knew. I had an endoscopy 
beforehand to check that 


my stomach was normal = «Wp ey J 

and to establish a base- decided 

line, and my boss said: fodrink 
“Barry, 'm not sure why fi 

you asked me to do this traaiead 


endoscopy, and I don’t 
want you to tell me? I did 
not expect to develop any 
symptoms, but I did become ill with vomiting 
and bad breath. A further endoscopy revealed 
the infection, proving that a healthy person 
could be infected by Helicobacter. Of course, 
there are plenty of things that can go wrong in 
a single self-experiment, and it is very doubt- 
ful that such a study would get published these 
days — even back then it was a bit of a stretch. 


my bosses.” 


Even after your self-experiment, the medical 
community remained sceptical that H. pylori 
was connected to stomach ulcers. How did you 
finally convince them? 

We were keen to present our data and 
announce that we had discovered the cause of 
ulcers, so we submitted our paper to the Aus- 
tralian Gastroenterology meeting in 1983. It 
was rejected. Fortunately, my boss at the time 
had some experience with Campylobacter, 
which was becoming a popular explanation 
for infectious colitis, or inflammation of the 
colon. Helicobacter looked similar, so I spoke to 
a Campylobacter expert in Britain and we sent 
him some cultures. He grew them and became 
excited about it, too. Then, in 1984, we went to 
a meeting of microbiologists, who are always 
interested in any new microbe, and things 
really took off after that. It took a few more 
years to gain support from gastroenterologists. 


There a growing body of evidence that 
infection with H. pylori during childhood may 
protect against immune diseases such as 
asthma and allergies. What is your take on this? 
For the past 30 years we have had the hygiene 
hypothesis, which states that a lack of early 
childhood exposure to microorganisms dis- 
rupts the natural development of the immune 
system. We know that hygiene levels have 
increased significantly during the past cen- 
tury and that allergic diseases are on the rise, 
but linking these trends is difficult. We know 
that parasitic worms, which are still com- 
mon in Africa but no longer in developed 
countries, suppress the immune system. 
Infection with Helicobacter used to be com- 
mon, but since the twentieth century that 
has been declining in developed countries. 
Finnish populations show decreased immu- 
noglobulin E, the antibody linked to allergies, 
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Stomach cells (stained blue) and Helicobacter pylori (yellow), a bacterium that causes peptic ulcers. 


in people with Helicobacter, and in New York 
City, studies have found that children with 
Helicobacter have a lower risk of developing 
asthma, eczema or any kind of allergic dis- 
order. These results are tantalizing, but other 
studies have not necessarily found the same 
thing. 


Stomach ulcers were once firmly believed to 
be non-infectious diseases, but you proved 
that a microbe was responsible. Will other 
long-term diseases turn out to have an 
infectious cause? 

As far as lam concerned, everything is envi- 
ronmental until you convince me that it is 
genetic. Take rheumatoid arthritis: we do not 
know the aetiology of it but we have got expen- 
sive treatments similar to the way we used to 
prescribe acid blockers for ulcers. Eventually 
we will figure out the actual mechanism that 
triggers this cascade of immune problems 
in rheumatoid arthritis — maybe it is a viral 
infection. Genomic and microbiological stud- 
ies are extremely powerful here. For example, 
when my grandchildren first started mixing 
with other children at playgroups, they were 
taking home a new virus every week. We need 
to collect samples and ask what those viruses 
are so that 20 years from now, when some of 
those kids develop serious illness, we can look 
back at their microbiologic history. There are 
a lot of data that need to be collected and there 
are fantastic research opportunities that will 
help to solve those problems. 


| understand that you are developing an edible 
vaccine made from H. pylori. 

Yes, although it has been harder than we 
thought. The idea is that you engineer an 
H. pylori strain that is deficient in some way 
and cannot give you permanent colonization. 
Then you clone some extra DNA into it, so that 


it could produce a useful peptide analogous 
to, for example, an influenza vaccine antigen. 
I expect that one day such oral vaccines will 
be available as food products in the supermar- 
ket, rather than requiring a needle. We are also 
working on probiotics related to H. pylori in 
clinical trials, and I have co-authored a paper 
looking at the migration of humans around 
the world based on variations in the H. pylori 
genome. Show me your H. pylori and I can tell 
you where you came from! 


What advice do you have for young scientists? 
First, do what you like to do, because turning 
up every day for a job you do not enjoy feels 
like a death sentence. Second, do not be afraid 
to sacrifice salary to do something that you 
are interested in. Third, keep some balance in 
your life — most of your papers are going to get 
rejected initially, and occasionally youre going 
to feel down, so it is good to have a partner 
with an objective perspective. 


If you had to be a microbe, which one would 
you be and why? 

Helicobacter pylori, because I would have no 
competition! m 


Meghan Azad is an assistant professor 
at the University of Manitoba and 
Manitoba Institute of Child Health in 
Winnipeg, Canada. 
Her research with the 
Canadian Healthy 
Infant Longitudinal 
Development (CHILD) 
study is focused on 
the early-life origins of 
chronic diseases and 
the gut microbiome. 
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Q&A Francoise Barré-Sinoussi 


HIV adversary 


Francoise Barré-Sinoussi and Luc Montagnier were jointly awarded the 2008 Nobel prize in 

Physiology or Medicine for their discovery of HIV in 1983. Three decades on, Barré-Sinoussi 
is director of the Retroviral Infections unit at the Pasteur Institute in Paris. Here, she tells Iria 
Gomez-Tourino about the latest strategies to combat the virus. 


HIV was discovered more than 30 years ago. 
How far have we come since then? 

The main achievement after the discovery of 
HIV was the diagnostic test, which meant that 
we could prevent transmission of the virus 
by blood and blood derivatives. The next big 
steps were the prevention of mother-to-child 
transmission using the antiretroviral treatment 
AZT in 1994 and the advent of potent com- 
binations of antiretroviral therapies in 1996. 
These are both good examples of what we call 
translational science, whereby basic knowl- 
edge is used to develop tests and treatments 
for the benefit of patients. 


It is estimated that for every HIV-infected 
person starting therapy two individuals are 
newly infected. What are we doing wrong? 
People are still really scared about being tested 
for HIV, even if they know that there is a treat- 
ment for it. In my experience, people worry 
that others could think they are drug users or 
sex workers and are afraid about being rejected 


by society. Unfortunately, this stigma still exists 
not only in resource-limited countries but also 
in countries such as France. 


Does the solution lie in better education or 
further research into treatments? 

Education is part of prevention, care and treat- 
ment. We can’t say prevention is more impor- 
tant than treatment or vice versa. If we do not 
treat the 35 million people who are already 
infected, the epidemic will continue. The 
treatment itself is also prevention, as we can 
reduce the transmission to others. We should 
also campaign for the use of existing preven- 
tative tools, such as the condom, but also for 
the development of new ones. Earlier this year 
there were some encouraging preliminary 
results based on a single injection of long-last- 
ing antiretrovirals, 


monthly. This kind DNATURE.COM 

of technology could Four films with laureates 
certainly beabreak- and young students: 
through. go.nature.com/uzypaz 
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To what extent is religion the cause of more 
people becoming infected? 

Religion is one of many factors, but it is an 
important one. When Pope Benedict XVI 
claimed [in 2005] that condoms are not the 
solution for HIV, this had a really bad impact 
on African Catholic countries and this is really 
ashame. We also have some countries drawing 
up homophobic legislation under the influence 
of religious dogma, but such measures will not 
reduce HIV infection. However, I have been 
in many places where local religious leaders 
are doing a remarkable job informing people 
about the risks and encouraging them to pro- 
tect themselves. 


What is the most promising route towards a 
cure for HIV infection? 

In my opinion, remission, which means that 
the virus is still present in a patient’s body but 
controlled so it does not replicate, is more 
likely to be achievable than a complete eradica- 
tion. We already have examples in which very 
early treatment after the infection has led to 
such remission. 

The VISCONTI patients [a group of 14 
patients in France who were all given antiretro- 
viral drugs soon after becoming infected] 
maintained a tight control of HIV replication 
several years after treatment was stopped. Also, 
the ‘Mississippi baby’ [an infant treated imme- 
diately after she was born with HIV] was able 
to maintain virological control of her infection 
for more than two years after the medication 
was stopped. Sadly, in this case the infection 
rebounded recently. We need to develop better 
tools to detect and measure the persistent virus. 


Why is a vaccine for HIV proving so elusive? 
There are lots of reasons. One is that the devel- 
opment of broadly neutralizing antibodies is 
very slow. Being highly variable, the virus can 
escape easily from the control of the immune 
system and the infection is very rapid, resulting 
in abnormal alteration of the immune defence. 
Vaccines are efficient and very often you still 
have very low levels of replication, which is 
good because it re-stimulates the immune 
system. In the case of the HIV antigen, 
re-stimulation can also be bad because trace 
amounts of antigens that are harmful to the 
immune system will prevent the vaccine from 
working. We have a list of antigens that can be 
harmful, but we don’t know which antigens ini- 
tiate the abnormal signalling in immune cells. 

A real breakthrough was the use of an SIV 
[simian immunodeficiency virus — the non- 
human primate equivalent of HIV] vaccine 
candidate using cytomegalovirus (CMV) as a 
vector. This CMV-based SIV vaccine is able to 
induce very efficient immune responses and to 
clear SIV infection in macaques. Recent results 
also show that a cocktail of broadly neutral- 
izing antibodies in mice and macaques can 
efficiently suppress HIV plasma viraemia and 
reduce proviral DNA. 
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In 2012 the International AIDS Society 
published seven priorities for HIV research. 
What has been the impact of this strategy? 
We decided to launch the Towards an HIV 
Cure initiative to stimulate and coordinate 
international efforts, and also to advocate 
for more research in the area. Several 
consortiums in the United States have 
been established to develop a cure for 
HIV, with experts coming from fields 
such as immu- 


“If we do not nology, genetics, 
treat the 35 virology and also 
million people the private sector. 
whoarealready Ourknowledge of 
infected, the HIV persistence 
epidemi cwill under antiretrovi- 


ral treatment has 
progressed in past 
years. Strategies 
being investigated include reactivating the 
latent virus to flush it out of the cells and 
then to kill the virus with immune agents 
or a vaccine. Gene therapy to make cells 
resistant to HIV infection is also being 
explored. 


continue. ” 


For the first time, this year’s Lindau meeting 
boasts more female young researchers than 
male. How can more women be encouraged 
to take scientific posts? 

When I first started work in the 1970s at 
the Institut Pasteur in Paris, France, there 
were no more than five female professors; 
today, the same institution has close to 
50% female professors, which is wonder- 
ful. One way forward is to better recognize 
the work of women, although I think that 
this is already progressing. Another issue 
is children. I made the choice not to have 
children because I thought it was too dif- 
ficult at that time to have a career anda 
family — although it might not be the best 
solution and many other women scientists 
do choose to have a family. Certainly we 
can better organize research institutions 
to offer childcare, for instance. While we 
all can agree that equity is a good thing, 
women shouldn't be selected just because 
they are women. = 


Iria Gomez- eg | 
Tourifio completed [| | 
her PhD in biology Py \ 
at the University 

of Santiago de 


Compostela, Spain, 
and is a Marie 
Curie postdoctoral 
fellow in the 
immunobiology department of King’s 
College London, where she focuses 
on identifying the T-cell receptors of 
autoreactive T cells in type 1 diabetes. 
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Q&A Michael Bishop 
Free thinker 


Michael Bishop and Harold Varmus proved that genetic changes could drive the formation of 
tumours. They were awarded the 1989 Nobel prize in Physiology or Medicine for discovering 
the origin of retroviral oncogenes. Bishop — now director of the GW Hooper Foundation at the 
University of California, San Francisco — tells Kipp Weiskopf about 40 years in cancer research. 


What first drew you to science, and to 
biomedical research in particular? 

My first scientific hero was Arrowsmith — 
the main character in the 1925 novel of the 
same name by Sinclair Lewis, which almost 
every medical student of my generation read. 
It is about an idealistic young man who starts 
out as a family physician but is not satisfied 
and wants to be a medical scientist who cures 
diseases. I identified with him because I grew 
up in rural Pennsylvania wanting to be a 
doctor but I was not very sophisticated. When 
I went to medical school at Harvard in Boston, 
Massachusetts, I had never seen the inside of 
a research laboratory, so I immediately took 
up with classmates who had undergraduate 
research experience and I credit them with 
my decision to try research. 


What has been the most exciting stage of 

your career? 

Ihad a great time working on polio in my early 
years in the lab. But I switched to retroviruses 


just before the discovery of reverse tran- 
scriptase, which was essential to the biotech- 
nology revolution. We found ourselves at the 
cutting edge of an absolutely new field in which 
things were moving extremely rapidly. Every 
young scientist’s objective should be to start 
something new because that’s when things 
are really fun. If I were beginning my scien- 
tific career today I would study neuroscience, 
which has fascinated me ever since I encoun- 
tered it during my first year at Harvard Medical 
School and which still has thrilling frontiers. 


Has working in the San Francisco Bay Area 
been a particular influence? 

When IJ arrived in 1968 it was in the middle 
of the Haight-Ashbury ‘hippie heaver’ era 
[named after a district in San Francisco], 
and a degree of openness also pervaded the 
academic community. I had other offers from 
institutes on the East Coast, but I disliked 
their academic pyramid structures. So I went 
to the University of California, San Francisco 
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Acervical cancer cell — many cases of this form of the disease are caused by the human papillomavirus. 


(UCSB), which at the time was of no conse- 
quence whatsoever. That did not bother me 
in the least because I was working on a very 
humble problem and having a wonderful 
time. There was an atmosphere that made it 
okay to explore any research direction. It was 
also a lively political environment. I flirted 
with the Peace and Freedom Party for a while, 
and it was the time of the Free Speech move- 
ment. There was an open spirit that I had 
never quite encountered before. 


In more than 40 years of cancer research, what 
hits have we scored? 

Two success stories are slam dunks. First, 
recognition of the fundamental role of the 
genome in cancer has completely transformed 
the way we think about every aspect of cancer. 
Consider the issue 
of what causes can- 


it3 
cer. I view this as the The fact that we 
most challenging have allowed the 
unsolved problem tob acco industry 
in cancer research. [0 continue 
Genome science [ control the 
may help solve this agendaisa 


problem, because public disgrace. - 
the nature of the 
damage in tumour DNA often represents the 
chemical signature of the causative agent. 
This is clearly seen in skin cancers caused by 
exposure to sunlight, and there are genomic 
clues for other cancers, such as breast cancer. 
Or consider early detection of cancer. It seems 
only a matter of time before either molecular 
cytology on excretions or circulating DNA 
help us to detect stealth tumours, such as pan- 
creatic and ovarian cancer. And of course, the 
implications for therapy are profound. 

The second big hit has been in public health 
— specifically, the substantial drop in lung 
cancer in the United States that is attributable 


to the dramatic decline in smoking. Unfor- 
tunately, we are not doing as well in some 
other realms, such as obesity, or immuniza- 
tion against the papillomavirus, which causes 
cervical cancer. 


Will we find a cure for cancer? 

It seems unlikely to me that there will ever be 
a single cure for cancer. The disease is just too 
heterogeneous for that. Instead, I would like 
to emphasize that if we are ever going to con- 
quer this disease, it will be by prevention. For 
example, we can prevent numerous diseases 
by vaccination against their causes. Examples 
include polio, measles, hepatitis B and cervical 
cancer. We need to know the causes of cancer 
in order to prevent the disease. The fact that 
we have not eradicated lung cancer caused by 
smoking and that we have allowed the tobacco 
industry to continue to control the agenda is 
a public disgrace — but the United States has 
blazed the path and in California we are doing 
better on this front than most other places. 


Has a career spent working on cancer made 
you more or less fearful of the disease? 

Some things haven't changed. My wife has 
colon cancer and the lead drug for that dis- 
ease is the same one I was prescribing when I 
was a young physician 50 years ago, which is 
pretty sobering. So yes, it is a fearsome disease; 
even with therapy you may never have a truly 
comfortable day in your life again. By combin- 
ing our eventual understanding about every 
lesion in the cancer genome with the emerging 
prospects of immunotherapy, though, I think 
the future is pretty bright. 


Is the current relationship between academia 
and the pharmaceutical industry the best 
model for drug development? 

It is a bit like what Winston Churchill said 
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about democracy: it’s a terrible system 
except for all of the others. We are in a mar- 
ket economy and we're going to stay that way 
because the development of drugs is very 
expensive. Some companies have shut down 
their research arms completely, relying on 
academia for new discoveries. The danger is 
that the money invested by pharmaceutical 
companies in academic research is very tar- 
geted, which could dilute the academic enter- 
prise by crowding out fundamental research. 


What do you see as the next frontiers in 
rational drug design? 

Ultimately, it lies in understanding the signal- 
ling pathways so well that we can feed a com- 
puter all the DNA sequence data and have it 
tell us what are the likely targets for therapy, 
and what potential for drug resistance lurks 
in the tumour. The frontier is bioinformatics 
that uses genomic data to design a regimen 
that is free of pitfalls. 


Twenty-five years after winning the Nobel 
prize, what inspired you to attend Lindau for 
the first time this year? 

I have always had a major calendar conflict 
at this time of year, but having met students 
who have been here and also having my 
colleague Elizabeth Blackburn recommend 
the experience, I decided I would give it a try. 
It is more substantive than I had anticipated 
and my experience with the young research- 
ers has been excellent. 


How did winning the Nobel prize change 

your life? 

The most important thing is that being 
awarded the Nobel prize has not changed 
the way I feel about myself. It also has not 
changed the way my colleagues think of me, 
and has not affected my bank account very 
much either! I do not see it as a burden, as 
some people have described it, because I do 
not take it too seriously. However, it was defi- 
nitely an asset while I was chancellor at UCSF 
because, rightly or wrongly, it said something 
to the general community about the quality of 
the institution. Of course, it has also made it 
possible to come to a place like Lindau, which 
is a plus (except for the jetlag). = 


Kipp Weiskopf is an 
MD/PhD student at 
Stanford University in 
California and works 
on the interaction 
between the immune 
system and cancer. He A 

has developed drugs f 

that target CD47 and 

stimulate immune cells, particularly 
macrophages, to recognize cancer cells 
as foreign and attack them. 
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Q&A Torsten Wiesel 
Progress in sight 


Torsten Wiesel is president emeritus of Rockefeller University in New York City. He shared half of 

the 1981 Nobel Prize in Physiology or Medicine with David Hubel for their discoveries concerning 
information processing in the visual system. He tells Stefano Sandrone about his greatest scientific 
achievement and his vision of the future. 


What kind of student were you? 

Iwas rather mischievous and not particularly 
focused on my studies. I was more interested 
in sport. When I turned 17, I became more 
serious about academia and began to evalu- 
ate myself more. It was then that I decided I 
would become a doctor. I read a lot and I met 
lots of different people. I was raised in the larg- 
est psychiatric hospital in Sweden, where my 
father was director and chief psychiatrist. This 
undoubtedly greatly influenced the develop- 
ment of my values and other aspects of my life. 


Why did you choose medicine? 

I went into medicine partly because of my 
upbringing in the hospital. Also, my eldest 
brother became schizophrenic in his early 
twenties and I wanted to better understand his 
condition. As a doctor I became quickly frus- 
trated with the lack of adequate treatment of 
mental illnesses, and returned to my professor 
in neuroscience who allowed me to work in 
his laboratory for a year. During that year, he 


received an enquiry from Stephen Kuffler at 
Johns Hopkins University in Baltimore, Mary- 
land, who was looking for a postdoc. And so 
it was by pure luck that I ended up working in 
one of the best labs in the world. This marked 
the beginning of my scientific career, although 
it also meant that I never completed my PhD. 


What was your relationship like with David 
Hubel, the other half of your scientific team? 
When I met David at Johns Hopkins I realized 
he was a very smart guy and we immediately 
recognized our shared interests. Though we 
were very different, we complemented one 
another. I called him my ‘scientific brother 
as we were not close friends outside science 
— our families did not interact and we did 
not go to the movies or that kind of thing. We 
usually carried out two experiments per week 
on Tuesdays and Thursdays, often working 
through the night, then the next day we would 
analyse the data and plan the next experiment. 
It was brilliant how this worked for 20 years. 
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Were you aware of the importance of your 
research into the visual system? 

We never talked about it. People told me it was 
important and my response was: the longer 
the research takes, the better it is. There was 
a lot of work to be done and although I was 
aware that people got the Nobel prize for such 
research and then went on the lecture circuit, 
I wanted to continue in the lab. I believe that 
if you decide to do something then you put 
your whole heart and energy into it. Had my 
science not worked out, I would have gone 
back to Sweden to be a doctor. Certainly, in 
terms of discovery, I got the most satisfaction 
from our studies of how the visual cortex is 
able to encode the orientation properties of 
an object. 


How different is the external ‘real’ world 
from what we see? 

The external world can be very different to 
our perception of it, depending on what our 
senses tell us. Some insects can see in differ- 
ent ways and their world is very different from 
ours. Because the basic wiring is the same in 
all humans, we can agree on certain things like 
colours and textures. But it is also clear that 
some people are better at certain things than 
others, such as mathematics, painting or writ- 
ing. This is related to high-level functioning 
of the brain. However, we do not even under- 
stand the basic circuitry behind auditory per- 
ception, such as how we hear music or voices. 


Will we ever fully understand the brain? 
Someone asked me this question after my 
speech at the Nobel dinner, and I replied: 
“Never, I hope.’ Although understanding 
the brain will be beneficial to helping solve 
problems associated with ageing, for example, 
I worry what might happen if governments 
get access to all the tricks. There are lessons 
to be learned from the atomic age here. There 
are things about which we always have to be 
vigorous and defensive. 


What will be the next paradigm shift in 
neuroscience? 

There are so many problems ranging from 
cells to circuitries that it is difficult to predict. 
In my area of competence, neurophysiology, 
we still need to understand the mechanisms of 
hearing and the circuitry of higher functions 
that allow us to recognize objects. I would like 
to know how the auditory system, with rela- 
tively few fibres, analyses information com- 
ing into the brain. We have such wonderful 
abilities to recognize voices as well as faces, yet 
we have no idea about how the brain and the 
auditory cortex make this possible. In general, 
we do not yet know how the brain is wired. 
In the 1960s and 1970s there was a big effort 
in artificial intelligence and a lot of resources 
invested, but it was pretty much a fiasco. The 
time was not right for that then, but the simul- 
taneous launch of the BRAIN [Brain Research 
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through Advancing Innovative Neuro- 
technologies] Initiative, announced by 
President Obama in 2013, and the Human 
Brain Project in Europe, also announced in 
2013, might be more timely. 


How does Sweden, home to the Nobel 
prize, treat its laureates? 

The prize is most revered in Asian coun- 
tries. If you have a Nobel prize and you 
visit China or Japan you are received 
as if you were a king. In Sweden less so, 
because the mentality is that we should 
all be treated as equals. A friend of mine 
once requested a table by the window 
when making a reservation at a restaurant 
to celebrate my birthday and mentioned 
that I was a laureate, only to be told that 
it made no difference. And you dont get 
better seats in the theatre, either. Here 
in Lindau it is different, of course. But I 
would like to see more people giving talks 
here, even if they are not recipients of the 
prize, because it shouldn't be an institution 
for ageing scientists. You want students to 
be exposed to the best there is. 


What tips would you give to a young 
scientist today? 

Science should be fun: you should enjoy 
what you do. In this era of ‘big science, there 
are still areas in neuroscience where an 
individual or small laboratory can make an 
important contribution, such as the study 
of the sensory and motor systems and the 
cortical circuitry underpinning the higher 
function of recognition of objects and 
places. My advice for an undecided bril- 
liant young person looking for an area of 
research is to enter the field with the sincere 
intention of helping to solve the intriguing 
questions of how the brain works. 


What is the most important lesson you 
have learnt? 

To respect other people’s point of view, 
even if you disagree. Lots of discoveries in 
science have been met with claims that they 
must be wrong, but it is a mistake to say 
that on the grounds that something doesn't 
agree with dogma. I have a deep sense 
of respect for everybody. From a janitor to 
a president, I deal with each person in the 
same way. @ 


Stefano Sandrone 
is a PhD student 

at King’s College 
London. He studies 
neuroplasticity 

and connectional 
neuroanatomy, 
and has a special 
interest in the history of neuroscience. 
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Q&A Brian Kobilka 
Stuck on structure 


Brian Kobilka shared the 2012 Nobel Prize in Chemistry with Robert Lefkowitz for their studies 
of G protein-coupled receptors. He is professor of molecular and cellular physiology at the 
Stanford University School of Medicine in California. Haya Jamal Azouz asks Kobilka what it 
takes to spend 30 years answering a single research question. 


What are G protein-coupled receptors 
(GPCRs) and why are they interesting? 

GPCRs are proteins found on the surface of all 
cells in the body that recognize and bind hor- 
mones and neurotransmitters. Their principal 
purpose is to transmit a signal to active pro- 
teins on the inside of the cell, thereby chang- 
ing the cell’s behaviour. There are more than 
800 GPCRs in the human genome. They 
mediate the majority of the body’s response 
to hormones and neurotransmitters, and are 
responsible for the senses of sight, smell and 
taste. GPCRs are involved in so many aspects 
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of normal physiology, including homeostasis. 
It is interesting to understand how protein 
structures mediate signalling behaviours; 
understanding the structures may be helpful in 
developing more selective and effective drugs 
for these receptors, which represent approxi- 
mately 30% of current drug targets. My initial 
interest in B-adrenergic receptors came from 
my clinical experi- 
ence using B-agonists 
to treat asthma and 
B-blockers to treat 
heart disease. 
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Why is the structure of GPCRs so hard to crack? 
To determine the structure of proteins such 
as GPCRs it is necessary to crystalize the pro- 
tein. The diffraction patterns of X-rays that 
pass through the crystals can then be used to 
determine the crystals’ 3D structure. The first 
GPCR structure to be solved — rhodopsin, 
which isa protein in the rods of the retina that 
can respond to a single photon of light — was 
an incredible challenge. Even though rhodop- 
sin is abundant and is one of the most bio- 
chemically stable GPCRs, it has relatively little 
polar surface area, which makes it difficult 
to form crystals. Solving the structure of the 
B-receptor, a different GPCR that is activated 
by the hormone adrenaline, was even more 
challenging. Unlike rhodopsin, there is no tis- 
sue in which the $-receptor is expressed at high 
levels so we had to use cultured cells to produce 
the receptor. The B-receptor is flexible and bio- 
chemically unstable and it is difficult to obtain 
enough protein to allow crystallography trials. 


Did you expect the project to be so tough? 
No! When we set out in the early 1990s, 
we didn’t know the first thing about 


crystallography or about the biochemical 
behaviour of these proteins, for example 
whether they were dynamic or unstable. Using 
a technique called fluorescence spectroscopy 
we were able to get structural information that 
provided insight into why it was so difficult 
to crystallize the B-receptors. We learned that 
the B-receptor did not operate as a simple two- 


state on-off system, 
but that its shape was “My wife 


complex and flexible. nderstands 
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same shape — but our 

fluorescence studies 

suggested that the B-receptor did not exist in 
a single conformation even when bound to an 
antagonist or agonist. A population of recep- 
tors in solution have different shapes — subtle 
differences, but sufficiently large to prevent 
crystal formation. 


What breakthrough allowed you to determine 
the structure of the B, adrenergic receptor? 
We finally obtained our first crystals in 2004, 
but they were too small to be analysed using 
conventional X-ray sources. I showed pictures 
of the crystals to Gebhard Schertler, who at 
the time was helping to develop a microfocus 
X-ray beamline at the European Synchro- 
tron Radiation Facility (ESRF) in Grenoble, 
France. We saw the first diffraction patterns 
at the ESRF in July 2005, confirming that we 
had a protein crystal. The quality was too poor 
to determine the 3D structure but the result 
gave us hope that we could improve the quality 
of the crystals. My wife joined me for the first 
experiment at the ESRF that July, and she was 
the first to see a diffraction pattern, confirming 
that we had a protein crystal. 

Until then I had felt that the project might 
fail, so I didn’t think it was suitable for a student 
or postdoctoral researcher to work on. After- 
wards I recruited two very talented postdocs to 
join the effort and they succeeded in determin- 
ing structures of the B, adrenergic receptor in 
2007 with the help of Stanford colleagues and 
collaborators from other universities. 


How has your wife contributed to your success? 
She has been extremely supportive and 
although she is not a trained biochemist she is 
very good at finding ways to make the research 
process more efficient. We met in our first 
biology class in college and we have worked 
together ever since, so she understands what 
I do and does not ask why I spend so much 
time in the lab. 


Were you driven by fear of another group 
discovering the GPCR structure first? 

I had always hoped that someone would get 
the result, but of course we wanted to be first. 
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We knew there were other groups working on 
similar projects and there were often rumours 
that one group or another had crystals. Even 
as recently as spring 2007, while we were 
working to obtain the final data for our two 
structures, there was a detailed rumour that a 
group in France had the B, structure and that 
a paper had been submitted. That turned out 
to be false, but it was fortunate for us because it 
prompted a friend at a Danish pharmaceutical 
company to donate US$100,000 to our project 
at a time when things were tight financially. 


Did you ever imagine that you might wina 
Nobel, and what effect it would have? 

The first time I really became aware of the 
prize was in the 1990s when I visited Stock- 
holm while on vacation with my family. We 
visited the city hall where the ceremony is held 
and our tour guide described the ceremony. I 
thought about how exciting it would be, but it 
never occurred to me that I might win it until 
2012, when I found out I'd been chosen. That 
first year was very disruptive, in part because I 
accepted too many invitations to speak at con- 
ferences and visit universities, often overseas. 
The volume of e-mail also increased dramati- 
cally and as a result I wasn’t spending enough 
time focusing on my research. 


Will you continue working in this field? 

Yes. There are plenty of challenges ahead in the 
GPCR field. A crystal structure only gives us 
a snapshot of the protein in a single state, but 
these proteins are in constant motion between 
different states. The role that dynamic behav- 
iour plays in receptor function is of great 
interest to membrane-protein structural 
biologists, biochemists, pharmacologists and 
pharmaceutical-company scientists. There is a 
lot more work required before we understand 
how receptors signal to G proteins and other 
cell-signalling and regulatory proteins such as 
kinases and arrestins. We also know very little 
about how receptors work in their native envi- 
ronment: the plasma membrane of living cells. 
Developing methods to study receptor struc- 
ture and dynamics in living cells may be even 
more challenging than crystallographic stud- 
ies. It will help us to understand the versatile 
signalling behaviour of GPCRs at a molecular 
level. By versatile, I mean that one receptor may 
signal through different intracellular signal- 
ling proteins. A better understanding of this 
behaviour may help us to develop more effec- 
tive drugs. = 


Haya Jamal Azouz is 
a medical student at 
Alfaisal University in 
Riyadh, Saudi Arabia, 
where she investigates 
novel approaches to 
cancer therapy. 
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Lorna Stewart (far left) quizzes young researchers John Lee, Claudine Gauthier and Alina Solomon (far right) about what they think happens as our bodies age. 


GERONTOLOGY 


Will you still need me, will 
you still feed me? 


As the Lindau Nobel Laureate Meetings turn 64, laureates and young researchers discuss 
growing old — and whether exercise and stress reduction can slow the ageing process. 


BY LORNA STEWART 


CC hat is the life expectancy of the 
world population today?” asks 
Hans Rosling, a global-health 


researcher at the Karolinska Institute in Stock- 
holm, during the opening ceremony of this 
year’s Lindau Nobel Laureate Meeting. The 
700-strong audience of young researchers and 
Nobel laureates reach for their keypads. “Is it 
50, 60 or 70 years old?” he continues. The audi- 
ence casts its vote. The correct answer, 70, gets 
the fewest hits. 

“Even chimps do better than that,’ jokes 
Rosling, hinting that the audience would 
have got closer to the correct value had they 
answered at random. But the serious point he 
is making is that our notions of global demo- 
graphics are outdated. And scientists need to 


know the facts if they are to set priorities for 
future medical research. Global life expec- 
tancy has risen dramatically during the past 
century, raising profound issues concerning 
the role of medical practice and the demands 
on scientific research. 

The science and ethics of ageing was a theme 
at the meeting, and also the focus of a series 
of discussions, captured by the Nature Video 
team (see www.nature.com/lindau/2014). 
During those conversations I kept returning 
to one question: should we concentrate efforts 
on treating conditions that affect us in old age 
or devote resources 


towards earlier stages NATURE.COM 
in life, when exercise Towatch Nature 
or stress reduction  Video'sfour films 
could have greater madeat Lindau see: 


long-term benefits? — go.nature.com/uzypa2 
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At Lindau, I discussed this issue with three 
young researchers and two Nobel laureates, 
and since then I have also put the question to 
other researchers in the field of ageing. 

Ageing is linked to a multitude of biological 
processes, but scientists know suprisingly lit- 
tle about why, and how, we age and die. “It’s a 
large and complicated business, the biology of 
ageing,’ says Thomas Kirkwood, who is asso- 
ciate dean for ageing at Newcastle University, 
UK. “We age because it was never a prior- 
ity for our genomes to invest in the kind of 
maintenance and repair that could keep you 
going very much longer — or hypothetically 
forever,’ he adds. 

To date, hundreds of genes connected to 
ageing and longevity have been identified, 
but there is no master switch. Instead, most 
of these genes perform functions that help to 
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maintain cells, such as repairing damage to 
DNA or regulating antioxidant levels. 

Individually, genes have a relatively small 
impact on lifespan, but together they account 
for 25% of our longevity, Kirkwood says. That 
means that one-quarter of your chance of liv- 
ing into old age comes from your parents, he 
explains, with the remainder left to chance 
and environmental factors. “We don’t know 
yet exactly how the remaining 75% breaks 
down, but I wouldn't be surprised if it turns 
out that as much as half of that is influenced 
by things like exercise and healthy nutrition,’ 
he says. 

The difficulty in ageing research is in iden- 
tifying the physiological and psychological 
changes that are attributable to an underlying 
ageing process and those that are caused by 
age-related diseases. In the hunt for the recipe 
for long life, scientists 


have frequently turned “Wye should 
to individuals and popu- consider 
lations who show excep- a balance 
tional longevity. Earlier between 

this year, researchers 5 

at a fresh perspec- vet fea 24 aid 
tive on the biology of age- dding li 

ing when they analysed’ adams ife 
DNA isolated from tis- Years. 


sues obtained during the 

autopsy of a Dutch woman named Hendrikje 
van Andel-Schipper, who had lived disease- 
free until the ripe old age of 115. 

Studying van Andel-Schipper’s body after 
her death in 2005, Henne Holstege, a geneti- 
cist at the VU University Medical Center in 
Amsterdam, the Netherlands, and her co- 
workers concluded that stem cells hold the key 
to understanding the limits of an individual's 
lifespan. They found that, by the end of her life, 
the majority of van Andel-Schipper’s white 
blood cells had come from just two stem cells. 
At birth, humans have 20,000 stem cells; it is 
not unusual for someone in old age to have so 
few remaining stem cells, but scientists had 
been uncertain whether it was old age or dis- 
ease that causes this loss. Van Andel-Schipper 
had been particularly healthy, so they proposed 
that it was the ageing process that had caused 
the reduction in her stem-cell count. Mouse 
studies” have found that stem cells decrease 
in number steadily throughout the mouse’s 
lifespan — researchers suspect that this is also 
the case in humans. The chromosomes in van 
Andel-Schipper’s two remaining blood stem 
cells had much shorter telomeres — caps at 
the ends that protect the chromosomes from 
deterioration— than those found in other cells. 
They suggested that her stem cells had reached 
the end of their ability to keep replenishing. 

Each time a cell replicates, its telomeres 
shorten. When telomeres are too short, the 
cell will either stop replicating and become 
senescent or it will die. If a cell with short- 
ened telomeres continues to replicate it can 
become abnormal. Exactly why some people's 


telomeres shorten more slowly than other 
people’s is not fully understood, but clues are 
emerging. 

Elizabeth Blackburn, who won the 2009 
Nobel Prize in Physiology or Medicine for her 
work on telomeres, is taking steps to keep hers 
long. She says that the key is to avoid getting 
stressed. Since uncovering the link between 
stress and telomeres’, Blackburn has taken 
up exercise and meditation, and at Lindau she 
encouraged me to do the same. Her view is that 
focusing on medical and lifestyle interventions 
when youre young benefits not just the indi- 
vidual — families will have more time to spend 
with their loved ones, too. 


MARATHON TASK 

Alongside the mechanisms of biological 
ageing, researchers are also interested in con- 
ditions that are related to growing older, such 
as Alzheimer’s disease, cardiovascular diseases, 
diabetes and cancer. Such diseases are becom- 
ing more prevalent as people live longer, and 
understanding and treating them is the focus 
for some of the young researchers who took 
part in the Nature Video discussion. 

Alina Solomon, a neurologist at the Karo- 
linska Institute, works with people who have 
dementia. She sees commonalities across 
diseases of old age. “Several of these non- 
communicable diseases at older ages have 
common risk factors, so if we address them we 
can address several of these problems at the 
same time,’ she says. She thinks that the best 
approach for biomedical sciences is to focus on 
helping us live healthier, not just longer, lives. 
“We should consider a balance between add- 
ing years to life and adding life to years,’ she 
explains. 

Solomon's view is shared by Oliver Smithies, 
joint winner of the 2007 Nobel Prize in Physi- 
ology or Medicine for his work on embryonic 
stem cells. He says that older people should not 
be the priority for medical science. At the age 
of 89 and still working in his laboratory at the 
University of North Carolina in Chapel Hill 
every day, not to mention piloting light air- 
craft in his spare time, Smithies is well placed 
to comment. “We have to be realistic about it?” 
he says, but notes that facing facts is where the 
problems start. “We are sentimental and we say 
everybody has a right to life, which is true, but 
we can't afford to preserve every life. Why live 
to be 80 with aches and pains?” 

Claudine Gauthier, a postdoc working on 
blood-vessel ageing at the Max Planck Insti- 
tute for Human Cognitive and Brain Sciences 
in Leipzig, Germany, also thinks that there 
are good reasons to focus medical science on 
a younger cohort — people aged 40-50 years 
old. She sees middle age as an inflection point 
in the ageing trajectory, a period when a body 
might be particularly sensitive to interven- 
tion. “If you look at any health parameter, the 
variance of it increases dramatically once you 
get to middle age,” she says. This means that 
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interventions or lifestyle changes might have 
a bigger impact here than at any other age. 
“Maybe the way to be healthy when you're 30 is 
not the same way to be healthy when you're 50” 

It comes down to prevention, she adds. 
“If you want to tackle ageing you've got 
to do it in a younger population because 
I don't think it’s sustainable in the long 
term to just cure every disease.” Blackburn 
agrees. “We can’t think of them as diseases 
of ageing,” she says. “Cancer unfolds silently, 
often for years, and then you say: ‘I got 
cancer. No, you didn't ‘get’ cancer, that’s a 
process that’s been going on for ages.” 


LIVE HEALTHIER FOR LONGER 

‘Health-spar’ is a phrase that came up a lot at 
the meeting. The idea is to focus on the num- 
ber of years that you remain healthy and active, 
rather than on the number of years that you 
live. Many people I spoke to said that the focus 
for biomedical science should be on extending 
good health, not just on extending life. But are 
living longer and being healthy really at odds 
with one another? 

It depends on how you view health, says 
Kirkwood. A large-scale survey’ of people 
over 85 years of age in Newcastle, UK, showed 
that most have multiple health problems but 
still regard themselves as in good or excellent 
health when comparing themselves to their 
contemporaries. “People have this notion 
that they will be bundles of misery suffering 
all kinds of illness and woe,” he says. “What 
we found was very far from the case. A large 
number of people were living very active, full 
and busy lives.” Perhaps, then, part of ageing 
healthily is about adjusting what we expect to 
be able to do. The good news, says Kirkwood, 
is that is there is nothing in our bodies to pro- 
gramme our death. “Our bodies are designed 
for survival, they’re just not built well enough 
to survive indefinitely.” 

John Lee, a PhD student at Drexel University 
College of Medicine in Philadelphia, Pennsyl- 
vania, understands this problem. He wants 
to live to 150, but thinks that it is more likely 
that his grandchildren will achieve this feat, 
rather than him. He is working on developing 
exoskeletons to help people who have had a 
spinal-cord injury, and believes that techno- 
logical solutions may ultimately fix our crum- 
bling bodies and help us to age better. “We 
don't expect to be running marathons at 150,” 
he says. But, with this kind of help, we could be 
over 100 and still doing things “as if we were 30 
again — or maybe 50”. m 


Lorna Stewart is a freelance writer and radio 
producer based in London, UK. 
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STOP THE 


MICROBIAL CHATTER 


Bacteria can coat everything from thermal springs to teeth. Researchers are looking for 
antibiotics that can subvert the signalling that the microbes use to carve their niche. 
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Sheets of communicating bacteria — or biofilms — are a common sight in the run-off channels from hot springs in Yellowstone National Park. 


BY VIVIEN MARX 


acteria are continually evolving ways 
B to avoid the effects of antibiotics, and 
with the pipeline of new drugs drying 
up, infections are becoming more and more 


difficult to fight. As the need for innovative 
solutions grows, some microbiologists are 


teaming up with chemists and engineers 
to try to find ways to subvert the microbes 
by interfering with the signals they use to 
communicate. 

To undermine the microbes’ language, 
scientists first need to work out what they are 
saying. Bacteria use chemical signals to syn- 
chronize behaviour across a population. That 
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behaviour can help us — in the digestion of 
food, say — but it can also kill us. 

Such molecular coordination is thought to 
be central to the formation of biofilms — slimy 
mats of bacteria that spread across surfaces 
such as hospital catheters or water filtration sys- 
tems. Some of the bacteria in a biofilm suspend 
their metabolism, explains microbiologist 
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> Peter Greenberg of the University of Wash- 
ington in Seattle, making antibiotics less effec- 
tive because they tend to target bacteria that are 
still growing. The bacteria can also cover them- 
selves in an armour made of polysaccharides 
and proteins that antibiotics find difficult to 
penetrate, says microbiologist Bonnie Bassler 
of Princeton University in New Jersey. 

Such resistance to antibiotics can be 
treacherous, especially for people who have 
conditions such as cystic fibrosis that lead to 
long-term infections. Repeated treatments with 
broad-spectrum antibiotics heightens the risk 
that the bacteria will become resistant. 

Bacterial communication was first studied in 
the 1960s, and not long afterwards, research- 
ers found that a marine bacterium known as 
Vibrio fischeri would start to shine brightly once 
its population reached a certain density’. The 
finding that bacteria will turn their light on syn- 
chronously under certain conditions suddenly 
rendered bacterial behaviour visible and meas- 
urable, says Bassler. But because most scientists 
believed that bacteria were incapable of “fancy 
things” such as signalling, she says, the collective 
behaviour was generally dismissed as a “goofy 
phenomenon of bacteria living in the ocean”. 

Since then, researchers have observed this 
‘quorum-sensing’ behaviour in many species” * 
and have started to decipher the biochemistry 
and genetics of how it happens’. They have also 
been developing devices with which to char- 
acterize the messages that are transmitted and 
received. 

In general, quorum sensing is triggered 
when signalling molecules emitted by individ- 
ual bacteria pass a certain threshold, at which 
point the molecules bind to receptors on the 
bacteria and cause the entire population to 
express specific genes at the same time. In the 
case of pathogenic bacteria, the synchronized 
behaviour can include the release of molecules 
knownas virulence factors, which help bacteria 
to colonize and harm their host. It also allows 
bacteria to create biofilms. As the organisms 
adhere to a surface, they keep signalling to one 
another. Once they sense a quorum, genes are 


SLUDGE 
FIGHT 


Bacteria can be used 
to prevent biofilms 
from clogging the 
filtration membranes 
used in wastewater 
treament. 


upregulated and sticky exopolysaccharides are 
produced that ‘glue’ the bacteria together. 

These findings initially led to excitement 
about the possibility of blocking infection by 
inhibiting bacterial communication. But the 
enthusiasm quickly waned when potential 
drugs failed in early-stage testing. 

Now, scientists are taking a more sophisticated 
approach. The problem with the early work 
turned out to be in the assumption that the com- 
munication required only a few molecules, says 
Herman Sintim, a chemical biologist at the Uni- 
versity of Maryland in College Park. The reality 
is much more complex, he says. “In human cul- 
tures, we all know that it does not take just one 
word to silence a crowd and so we should not 
expect that from our distant cousins, bacteria.” 

It has taken some time, but the research 
community in this field has grown and research- 
ers have finally amassed enough knowledge 
about bacterial behaviour to start exploring 
how to stop the organisms from talking. “We 
are now getting there,” says Bassler. Academ- 
ics and companies are looking at fresh ways to 
study bacterial chatter and to create potential 
communication-disrupting drugs and agents 
for industrial and agricultural applications. 


THE LANGUAGE OF BACTERIA 

In developing drug candidates, researchers are 
sharpening their attack on infections beyond 
the broad-spectrum antibiotics currently in 
use. We need to talk to a specific bacterium “in 
a language only it understands’, says Martin 
Blaser, director of the Human Microbiome Pro- 
gram at New York University Langone Medical 
Center. Narrow-spectrum antibiotics are less 
likely to engender resistance because they put 
fewer species under selection pressure. They 
also cause less disruption to the body’s com- 
munity of microbes — its microbiome. Broad- 
spectrum antibiotics will also remain necessary, 
especially for people who are very ill. In general, 
they are assumed not to have lingering effects, 
but Blaser says that “there's more and more evi- 
dence that’s just not true”. They could even wipe 
out microbial communities involved in the 
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As signalling molecules build up, the bacteria 
form a biofilm and clog the membrane. 


Beads containing signal-digesting bacteria 
help to prevent biofilms. _ aug 
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—— Bead 


CELL COMMUNICATION (upstei Ne) meley 


Infection-causing bacteria (red) are often buried 
deep in tissue and surrounded by white blood 
cells (blue), making them difficult to target. 


developing metabolism of infants and children. 

It might take some time, but research on 
bacterial communication will “without ques- 
tion” deliver therapeutic opportunities, says 
Ronald Farquhar, who directs research at Cubist 
Pharmaceuticals in Lexington, Massachusetts. 
Regulatory agencies are particularly open to 
drug-firm suggestions that will meet the needs 
of people with chronic infections, he says. For 
example, someone who needs to use a urinary 
catheter for a long period of time could take a 
low-dose agent to stop bacteria from forming a 
biofilm on the device. 

Some drug candidates have already been 
identified. Microbiologists David Davies and 
Claudia Marques from Binghamton Univer- 
sity in New York, for example, have found a 
chemical that some bacteria make to address 
overcrowding®. The bacteria continuously pro- 
duce cis-2-decenoic acid, a communication 
molecule. When the molecule reaches a criti- 
cal threshold in a biofilm, a cascade of events is 
triggered, including changes in gene expression, 
prompting the bacteria to release themselves 
from the biofilm and disperse. Davies is now 
starting a company to commercialize a synthetic 
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CELL COMMUNICATION 


version of the acid for treating acne and disin- 
fecting wounds. 

But Greenberg, among others, thinks that 
caution is in order before moving potential 
therapies towards the clinic. Dispersing a bio- 
film could end an infection, he says, but it might 
also distribute it. “You might be making more 
trouble than you had to start with,” he says. 

Indeed, bacterial communication reveals ever 
more complexities. He has found, for example, 
that some bacteria in a community are cheats: 
they do not join the others in secreting enzymes 
in response to quorum-sensing signals, but still 
share in the benefits. “There are mixtures of 
cheats and cooperators in our laboratory experi- 
ments,’ Greenberg says, and a similar mix might 
be present in the infected lungs of a person 
with cystic fibrosis. Potential drugs could well 
be stymied by those cheats. Before developing 
therapies that disrupt communication, scientists 
need to know much more about quorum sens- 
ing and other bacterial behaviour, he says. 

Another complication is crosstalk between 
species and even across kingdoms. For 
example, Vanessa Sperandio, who studies 
bacterial communication at the University 
of Texas Southwestern in Dallas, has found 
that the stress hormones adrenaline and 
noradrenaline, which are present in the gut 
and elsewhere in the body, can amplify bac- 
terial signalling and increase the virulence of 
Escherichia coli O157:H7 (ref. 7), a pathogen 
that causes bloody diarrhoea and can be fatal. 


OTHER APPLICATIONS 

To better understand the complexities of 

bacterial communication and how to use them 

against disease, the field is also turning to theo- 

retical work, such as computational modelling 

and simulation, and to experiments with bacte- 

rial pathogens of plants. Greenberg and Lianhui 

Zhang, at the AStar Institute of Molecular and 

Cell Biology in Singapore, are working on a pro- 

ject funded by the Chinese government to use 

quorum-sensing inhibitors on crop pathogens. 

Such experiments could be proof-of-principle 

for biomedical applications, Greenberg says. 
Quorum-sensing inhibitors could well 

make it to market in 

agriculture before bio- 

medicine, says Paul 

Williams, a chemical 

biologist and pharma- 

cologist at the Univer- 

sity of Nottingham, 

UK, a hub for bacterial- 

communication research. Scientists and compa- 

nies are also testing communication inhibitors 

for industrial applications. For example, micro- 

organisms are being used in bioreactors to 

degrade the pollutants in wastewater. The water 

is then passed through a filter, but a build-up of 

bacteria can clog the pores of the membrane. 

The reactor then has to be taken offline, flushed 

out and cleaned with harsh chemicals such as 

chlorine — an energy-intensive process that 
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The marine bacterium Vibrio fischeri glows brightly when it reaches a certain cell density, or quorum. 


incurs more than half the cost of running a 
membrane bioreactor, says Chung-Hak Lee, a 
chemical engineer at Seoul National University. 

Lee has come up with a potential solution. 
His approach taps into a typical communi- 
cation network found in biofilms, in which 
enzymes secreted by some species digest sig- 
nalling molecules emitted by others. He and 
his team isolated such signal-quenching bacte- 
ria and placed them in beads that contain pores 
that keep the bacteria in, but let signalling 
molecules pass through. When placed near 
the filtration membrane in a bioreactor, the 
beads undermine bacterial communication 
and help to stop biofilms from forming (see 
‘Sludge fight’). In lab tests and in a pilot-scale 
wastewater treatment plant, Lee has found that 
the beads save almost half of the energy costs of 
a conventional membrane bioreactor. 

Several companies are exploring how to 
prevent biofilms for industrial and biomedi- 
cal applications. Selenium, a spin-off com- 
pany from Texas Tech University in Austin 
that is backed by the venture-capital firm 
Emergent Technologies, is developing sele- 
nium-containing coatings that could protect 
materials such as catheters, contact lenses and 
voice prostheses by producing reactive oxy- 
gen molecules that ward off bacteria. 

Another company, Curza, founded last year 
in Salt Lake City, Utah, is developing coatings 
that prevent biofilms from forming on hip and 
knee implants. Its research involves chemical 
synthesis, molecular genetics, mass spectrom- 
etry and scanning electron microscopy, as 
well asa proprietary flow cell assay that better 
represents physiological conditions by using 
liquid flow rather than stagnant broth assays. 
The company says that the assay can help to 
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characterize whether a biofilm is prevented 
under real-life-like conditions and show what 
might happen as an antimicrobial compound 
dilutes away from a medical device's coating, 
for example. 

And Kane Biotech of Winnipeg in Canada 
is developing combinations of antimicrobials 
and biofilm inhibitors for coating biomedi- 
cal devices, treating wounds and protecting 
teeth and skin. Sri Madhyastha, chief scien- 
tific officer, says that one of their products has 
been licensed by a medical-device company. 
Kane also sells products through veterinarians 
and distributors, including a water additive 
aimed at preventing plaque from forming on 
the teeth of pets. 

The company tried to obtain approval from 
the US Food and Drug Administration for an 
anti-biofilm enzyme in a wound-care product, 
but as a new chemical entity, it would require 
extensive testing. That route “is too expen- 
sive and time-consuming’, Madhyastha says, 
so the company has put this product on the 
back-burner. 


OBSERVATION PLATFORMS 

To test their potential products, Kane’s 
researchers use confocal microscopy and an 
instrument called the CDC Biofilm Reactor: 
a 1-litre beaker containing 8 slim rods around 
which liquid moves. Dotting the length of the 
rods are 24 circular disks on which biofilms 
can be grown and tested. The reactor was built 
under a licence from the US Centers for Dis- 
ease Control and Prevention by BioSurface 
Technologies of Bozeman, Montana, which 
sells several other types of vessel in which 
scientists can grow and disrupt biofilms in a 
controlled, standardized environment. 
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At Fluxion Biosciences in South San 
Francisco, California, cell biologist Bryan 
Haines helps labs to set up the firm’s BioFlux 
microfluidic platforms. The platforms allow 
scientists to do 24 biofilm experiments on one 
multiple-well plate. The temperature and gas 
content in the medium can be adjusted to suit 
the preferred growth conditions of the bacte- 
rium being studied. The wells are the reservoirs 
for reagents, potential antibiotics and bacteria; 
running underneath them are micrometre- 
scale channels in which a biofilm can grow. 
The plate is sealed at the top and users select 
the pressure with which to distribute fluids and 
cells through the channels, then observe the 
biofilm through an inverted microscope. 

But Sintim says that scientists need better 
assays if they are to study the subtleties of bac- 
terial communication. Cells live in a three- 
dimensional architecture and respond to many 
cues. And biofilms contain multiple species, 
making a specific biofilm hard to culture using 
traditional approaches. “Many systems that 
have been developed to date are reductionist 
systems, Sintim says, “and it is not obvious to 
me if data obtained from these reductionist 
platforms have any biological meaning” 

Together with bioengineer William Bentley 
at his university, Sintim is developing a micro- 
fluidic system that will not just track cells mov- 
ing through a three-dimensional space, but will 
also let experimenters perturb conditions and 
measure changes in appearance and behav- 
iour. Their system uses a membrane to sepa- 
rate two types of bacteria. On one side of the 
membrane are bacteria they have engineered 
to fluoresce green under ultraviolet light. These 
bacteria secrete signalling molecules that can 
pass through the membrane. On the other 
side are bacteria engineered to fluoresce red 
only when they receive that signal. The device 
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WATCH 


Green-fluorescing bacteria 
emit signalling molecules. 


Nutrients 


Membrane 


Thomas Bjarnsholt wants assays that mimic the 
way that bacteria can be shielded from antibiotics. 


allows researchers to alter the environment 
of each side independently and to control the 
rates of flow of liquids across the device (see 
Just watch’). Scientists can then study the effect 
of different gradients in a setting that is more 
typical of, for example, the body. 

Thomas Bjarnsholt helps university-hos- 
pital physicians to diagnose infections and 
has a microbiology lab at the University of 
Copenhagen, where he is building a system for 
studying biofilms. Current assays do a poor 
job of showing how slowly a biofilm forms 
on a medical implant, he says, so he wants to 
develop an assay that more closely mimics 
the in vivo conditions. Also, only a few people 
develop infections when their hips or knees 
are replaced, so he hopes to determine what 
makes some luckier than others. 

In his view, a communication disrupter 
should be tested not just by adding it to a 


Researchers at the University of Maryland are building a microfluidic device to 
study bacterial signalling. A membrane separates two types of bacteria — one 
fluoresces green and the other red — but allows the passage of signalling 
molecules. The device allows scientists to change the flow rate of liquids to look 
at concentration gradients, as well as to adjust various environmental factors, 
then observe how that affects communication between the bacteria. 


Bacteria fluoresce red 
when they receive signals 
from the ‘green’ bacteria. 


Signalling 
molecules 
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CELL COMMUNICATION (upseNe)Reley 


biofilm. In the chronically infected lung of a 
person with cystic fibrosis, antibiotics have to 
travel through the bloodstream, then diffuse 
through necrotic material, mucus and pus to get 
to the infection site. “It’s all embedded in slime? 
he says. The slime also has anaerobic pockets, 
where antibiotics tend to fail. Just 40 micro- 
metres of pus or mucus suffice to create such 
pockets. He is developing surfaces, gels and 
other media that mimic this kind of shielding 
and allow researchers to take this into account. 

Quorum-sensing inhibitors and other com- 
munication disrupters will eventually emerge, 
Bjarnsholt predicts*. An area of interest for 
him is dressings, especially for people with 
diabetes, who repeatedly develop wounds. At 
the moment, dressings often contain silver, 
which acts as an antibacterial treatment, but 
infections still develop, so new approaches are 
needed, he says. 

But new antibiotics will need more-expen- 
sive tests that require greater expertise to 
administer, says Sperandio. The standard way 
to test antibiotics is the minimal inhibitory 
concentration test, which measures the con- 
centration at which a compound needs to be 
administered to stop bacteria from growing. 
Williams points out that this approach “is obvi- 
ously of no use” for assessing compounds that 
disrupt communication. 

In fact, says Sperandio, the whole communi- 
cations approach to curing infection is at odds 
with the long-held dogma that a cure means 
killing the microbes. Communication disrup- 
tors could prevent pathogenesis without killing 
the pathogen, for example. Except in rare cases, 
such as infections of heart valves, it is not nec- 
essary to kill every bacterium, says Blaser. Even 
conventional antibiotics do not sterilize an 
organ; they reduce replication rates and “ulti- 
mately it is the immune response in patients 
that clears the infection’, he says. New antibiot- 
ics could battle bacteria in this way, too. 

The war on harmful bacteria is most defi- 
nitely a war that humans need to win, says 
Blaser. But that does not mean we have to 
harm ourselves in the process.“We don’t want 
a Pyrrhic victory,’ he says. m 


Vivien Marx is technology editor for Nature 
and Nature Methods. 
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Directors: John Tooze launched The EMBO Journal; Frank Gannon lobbied for better funding; Maria Leptin is forging new alliances. 


Fifty years of EMBO 


Georgina Ferry reflects on the evolution of the European Molecular Biology 
Organization, founded to help Europe to compete with the United States. 


Crisis in 1962. The nuclear physicist 

Leo Szilard went to Geneva in Swit- 
zerland “because he thought America 
was going to be bombed’, recalls Sydney 
Brenner, a founding member of European 
Molecular Biology Organization (EMBO). 
There, Szilard met Victor Weisskopf, the 
head of CERN, Europe’s particle-physics 
lab. “They wanted to found CERB, Centre 
Européenne de Recherche Biologique,’ says 
Brenner.“Nuclear physics and molecular 
biology would go together” 

That catalysing moment gave rise to an 
organization that, taking cues from the 
Rockefeller Foundation and the Cold Spring 
Harbor Laboratory in the United States, has 
acted as a matchmaker, educator, benevo- 
lent godparent and advocate for Europe’s 
life scientists. EMBO’s elected membership 
has included 79 Nobel prizewinners, and its 
fellowship schemes have supported thou- 
sands of young researchers. 

EMBO owes its origin and evolution to 
the enduring challenge of making Euro- 
pean scientists better connected and thence 
more competitive. How, as it celebrates its 
half-century, is EMBO remodelling itself for 
the very different landscape of twenty-first- 
century life sciences? 

In the late 1950s and early 1960s, ambi- 
tious molecular biologists were leaving 
Europe for the United States. In 1958, 


I: began during the Cuban Missile 
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Jacques Monod, part of a powerful nucleus 
of molecular biology at the Pasteur Institute 
in Paris who went on to win a Nobel prize, 
warned that the new discipline was forg- 
ing ahead on the other side of the Atlantic 
because the structure of European univer- 
sities put up barriers between disciplines, 
institutions and countries. 

Monod’s proposal for a European insti- 
tute in Paris went unfunded. In Italy, the 
geneticist Adriano Buzzati-Traverso was 
more successful. He established the Inter- 
national Laboratory of Genetics and Bio- 
physics (ILGB) in Naples in 1962, with 
support from the Italian National Coun- 
cil for Nuclear Research. The ILGB paid 
higher salaries than Italian universities and 
attracted researchers from abroad. 

Meanwhile, Weisskopf at CERN con- 
sulted John Kendrew from the Laboratory 
of Molecular Biology (LMB) in Cambridge, 
UK, who had that year received a Nobel prize 
for his structure of the protein myoglobin. 
Kendrew immediately saw ‘CERB’ as a way 
to achieve a level of autonomy that was not 
available to him in Cambridge. He became 
its principal advocate and driving force. 

In September 1963, European molecular 
biologists met in Ravello, Italy. A power- 
ful group argued that rather than building 
a lab, a federal organization should foster 
interaction by providing fellowships to 
send scientists to laboratories elsewhere in 
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Europe, and run regular practical courses 
where they could learn new techniques such 
as phage genetics. Buzzati-Traverso sup- 
ported this proposal, fearing that a second 
international lab would threaten his ILGB. 

Ever the diplomat, Kendrew obtained 
unanimous votes both to work towards 
the creation ofa lab and to set up a federal 
organization. The new body would be called 
the European Molecular Biology Organiza- 
tion. Like an academy, it would elect mem- 
bers on merit. With three years of start-up 
funds from the Volkswagen Foundation, it 
was incorporated as a non-profit body in 
Switzerland on 12 July 1964. 


THE RIGHT DIRECTION 
The character and influence of EMBO owes 
a great deal to its directors. The first was 
the British physicist and radiation biologist 
Raymond Appleyard. He established and ran 
EMBO’s fellowship scheme with minimal 
bureaucracy from his office in Brussels while 
formally employed by the European Atomic 
Energy Community (Euratom), a body for 
the peaceful use of nuclear technology. By 
the end of the 1960s, 14 countries had come 
together to fund EMBO’s activities: Aus- 
tria, Belgium, Denmark, West Germany, 
France, Greece, Israel, Italy, the Netherlands, 
Norway, Spain, Sweden, Switzerland and the 
United Kingdom. 

Kendrew finally secured the agreement of 
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ten of the member states to fund a European 
Molecular Biology Laboratory (EMBL). 
With him as its first director, EMBL opened 
in 1974 in Heidelberg, Germany. It is per- 
haps not entirely coincidental that EMBL 
location on the edge of a pleasant and his- 
toric university town bears many similarities 
to that of the LMB. 

EMBLs achievements include the Nobel 
prize awarded to Christiane Niisslein- 
Volhard and Eric Wieschaus in 1995 for their 
work on early embryonic development. At 
times, it has been hard for outsiders to grasp 
the distinction between EMBO and EMBL. 
What is certain is that neither would have 
existed without the other. 


PUBLISHING AND ASILOMAR 

When EMBO, too, moved to Heidelberg 
in 1973, the British molecular biologist 
John Tooze took the helm. In 1982, Tooze 
established and began to edit The EMBO 
Journal, which he ran almost single-hand- 
edly until the end of his 20-year term. The 
journal promoted the EMBO name beyond 
Europe's borders, and provided a second 
income stream. It is now ranked nineteenth 
by impact factor of journals in cell biology 
and biochemistry. 

Tooze took over just as recombinant DNA 
technology was taking the field by storm. 
EMBO members, including Ken and Noreen 
Murray at the Uni- 


versity of Edinburgh, “EMBOhas 
UK, ranworkshopsto extendedits 
introduce European 1,500-strong 
scientists to the new membership 
techniques. One of tonewareas 
the Murrays earlystu- suchas 

dents was Paul Nurse, neurobiology 


who went on to win 
a Nobel prize and is 
the current president of the Royal Society in 
London. “We got lots of hands-on experi- 
ence and also exposure to some of the great 
molecular geneticists of the time,” he wrote 
in 2004, in EMBO: 40 Years of Success. 

In February 1975, after US scientists 
raised fears about the possible dangers of 
the DNA technology, a conference in Asilo- 
mar, California, agreed a voluntary morato- 
rium on recombinant DNA research. Tooze 
told the US National Institutes of Health 
(NIH) that EMBO would be unable to rec- 
ommend that European researchers adopt 
the highly restrictive draft guidelines then 
under consideration, and the organization 
set up its own recombinant DNA commit- 
tee. With Ken Murray’s help, Tooze organ- 
ized an experiment to prove that viral DNA 
was much safer integrated into a bacterial 
plasmid than it was as part of an intact 
virus particle (M. Fried et al. Nature 279, 
811-816; 1979). 

As a result, the NIH held a workshop 
with EMBO in the United Kingdom, and 


and ecology.” 


subsequently withdrew its draft guidelines 
that would have required all recombinant 
research to be carried out in biosafety- 
level-3 containment. “I think that was a turn- 
ing point in the regulation of recombinant 
DNA research in terms of its potential as a 
biohazard,’ says Tooze. 


POLICY PLAYER 

Frank Gannon, a molecular biologist from 
University College, Galway, in Ireland, took 
a different approach when he took over in 
1994. “I saw EMBO as a way of permeating 
science throughout Europe with excellence, 
and of influencing the European Union who 
were becoming very strong at this stage,” 
says Gannon. To weld EMBO members and 
fellows into a community, he introduced 
annual workshops and launched an awards 
and mentoring scheme called the Young 
Investigator Programme. 

By 2000, the number of countries invest- 
ing in EMBO had more than doubled and 
included several Eastern European nations 
where science was poorly resourced. Many 
bright young fellows from those countries 
were making their careers overseas — wors- 
ening the state of science in their home 
nations. So EMBO set up ‘installation grants, 
with support from host countries, to enable 
returning fellows or researchers to start their 
own labs. The first countries to volunteer for 
the scheme were Croatia, the Czech Repub- 
lic, Estonia, Hungary, Poland, Portugal and 
Turkey. These schemes, The EMBO Journal, 
two new journals and two big policy pro- 
grammes to encourage engagement with 
society and international collaboration 
saw EMBO grow from 4 to 40 members of 
staff under Gannon. In 2001, it opened its 
own building, on land donated by EMBL in 
Heidelberg. 

Next, EMBO went into battle with the 
European Commission over its policy on 
grant-making for scientific research. The 
Framework Programmes of the European 
Union were ‘top-down funding mechanisms 
geared towards economic impact. With no 
European money for bright ideas by individ- 
ual scientists, there was a growing demand 
in the scientific community for a European 
Research Council (ERC), modelled on the 
US National Science Foundation and any 
number of national research councils. 


BIGGER TENT 

EMBO tooka lead in lobbying for this change, 
and the ERC was founded in 2007. In its first 
five years it disbursed more than €4 bil- 
lion (US$5.4 billion) to 2,500 researchers in 
480 European institutions. Inevitably a few 
institutions in a few countries have received 
a disproportionate share. EMBO and the 
ERC cling to the principle that all awards 
should be based on merit alone, and asa 
result contend with a chorus of complaints 
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from the countries that feel snubbed. 

EMBO’ current director is the indefatiga- 
ble Maria Leptin, a professor at the Institute 
of Genetics at the University of Cologne in 
Germany, head of an EMBL lab and presi- 
dent of the lobby group Initiative for Science 
in Europe. Over the years, career adminis- 
trators at the tiller of the organization have 
given way to working scientists who under- 
stand the community. They leave the day- 
to-day running to a professional secretariat 
set up by Leptin’s predecessor, the German 
molecular biologist Hermann Bujard at the 
Centre for Molecular Biology at the Univer- 
sity of Heidelberg. 

The most difficult question that Leptin 
faces, perhaps more than for any of her pre- 
decessors, is what is EMBO for? Its original 
raison détre, to catch up with the United 
States in the techniques of molecular biology 
and to integrate Europe’s community in the 
field, has long since been achieved. Molecu- 
lar biology has entered the mainstream: few 
branches of biology can now progress with- 
out occasionally manipulating some DNA or 
solving a protein structure. 

EMBO has extended its 1,500-strong 
membership to new areas such as neuro- 
biology and ecology. Associate members 
can be of any nationality, and any scien- 
tist can apply for an EMBO fellowship to 
come to a European institution. Countries 
outside Europe, including South Africa, 
Taiwan and Singapore, have gained access 
to EMBO’s programmes through coop- 
eration agreements. No longer exclusively 
European nor exclusively molecular, in 
2012 EMBO stopped spelling out its name 
and adopted the brand ‘EMBO: excellence 
in life sciences’ 

EMBO still has problems to solve, old 
and new. Because of language barriers, pen- 
sion structures and a host of other factors, 
it is more difficult for European scientists 
to move between countries than it is for US 
scientists to progress around the large number 
of excellent institutions in their home coun- 
try. EMBO also has to adapt to shifting career 
structures. For example, postdoctoral training 
has changed from a two-year stint to a five- 
year preparation for independence. EMBO is 
the first funding organization to have intro- 
duced a ‘portable pensior for its fellows, and 
it supports the European Commission's slow 
progress towards a European Research Area. 

The scientific environment in Europe has 
changed out ofall recognition since EMBO’s 
founding 50 years ago. The geopolitical 
landscape still leaves the organization some 
mountains to climb. = 


Georgina Ferry is a science writer based 
in Oxford, UK. She is the author of EMBO 
in perspective: A half century in the life 
sciences. 


e-mail: mgf@georginaferry.com 
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ILLUSTRATION BY DAVID PARKINS 


Turning brain 
drain into brain 
circulation 


Overseas scholarships that encourage scientists to 
return to their home countries are helping to rebuild 
science in Latin America, says Torsten Wiesel. 


build a strong base in science, but only 

a short time to destroy it. Germany was 
a sad example. It was a world leader in the 
sciences for more than a century, until its 
science base was demolished during the 
Nazi era, and the country ceded its position 
to the United States. It has taken decades for 
Germany to rise again to its current level of 
excellence. 


lf takes a long time for a country to 


The German experience has much in 
common with the situation in Latin Amer- 
ica, where authoritarian regimes came to 
power in the mid-twentieth century in coun- 
tries including Brazil, Chile and Argentina. 
Asa consequence, many of the continent’s 
best scientists emigrated to the United States, 
Europe and Canada. When the dictatorships 
were finally shaken off in the 1980s and 
1990s, the departed scientists were settled 
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in their new homes and had little incentive 
to return to countries left laden with debt. 

Many have forgotten that science in 
Latin America was once robust. For exam- 
ple, Bernardo Houssay, who won the 1947 
Nobel Prize in Physiology or Medicine, 
directed the Institute of Physiology at 
Buenos Aires University until 1943, when 
the government fired him for advocating 
for democracy; his protégé, Luis Leloir, won 
the 1970 Nobel Prize in Chemistry. Several 
emigrants also became laureates, includ- 
ing the immunologist Baruj Benacerraf, 
from Venezuela, and the biochemist 
César Milstein, from Argentina. 

Against this background, the Pew Latin 
American Fellows Program was founded to 
help to rebuild and strengthen biomedical 
sciences in the region. From its inception, 
the programme has been linked to the pre- 
existing Pew Biomedical Scholars Program, 
which each year provides around 20 promis- 
ing newly independent US scientists with 
four-year scholarships, funded by the Pew 
Charitable Trusts, a non-profit organization 
based in Philadelphia, Pennsylvania. 

In March 1989, at the annual meeting 
of the scholars programme in Puerto Val- 
larta, Mexico, a group of these scholars 
— struck by the lack of resources of their 
counterparts in Mexico — sought help from 
Rebecca Rimel, president of the Pew Chari- 
table Trusts. Later, Rebecca and I discussed 
the best ways to train talented students from 
Latin America, and our ideas crystallized 
into the fellows programme. 


REPATRIATION RATES 

Since the founding of the Pew Latin 
American Fellows Program in 1991, about 
ten graduate students each year have been 
awarded two-year postdoctoral fellowships 
to work in some of the best labs in North 
America. It is no surprise that some remain 
abroad to continue their careers in more 
developed countries. What is surprising is 
that more than 70% return to their home 
countries, which may not always allocate 
sufficient resources to cutting-edge research 
(see ‘Bringing science home’). For compari- 
son, the Human Frontier Science Program, 
a multinational initiative that supports the 
life sciences, also funds postdoctoral fellows 
worldwide — but fewer than half of those 
who train in the United States return to their 
home countries. 

Pew fellows who remain in North 
America have positions in leading uni- 
versities and several have established joint 
projects with labs in their home countries, 
as well as hosting new fellows. The annual 


SOUTH AMERICAN SCIENCE 


A Nature special issue 
nature.com/southamerica 
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PEW LATIN AMERICAN FELLOWSHIP 


Bringing science home 


Becoming a great scientist requires exposure to greatness. 

At a 1997 orientation meeting in Costa Rica for new postdocs, 
Torsten Weisel, the co-founder of the Pew Latin American Fellows 
Program, told us that the best scientists are not necessarily more 
creative or smarter than everyone else, but that they had the 
opportunity in their junior years to conduct and discuss science in 
prime environments. 

| earned my PhD in 1996 from the University of Chile in Santiago, 
studying how ions move through proteins extracted from neurons. 
| wanted to apply that work in living brains. Senior members in my 
department told me about the Latin American fellows programme 
and helped me to find a postdoctoral adviser. 

Charles Zuker, then at the University of California, San Diego, 
accepted me into his lab and taught me to study how flies sense 
the world. It was an amazing experience to be in the Zuker lab when 
seminal work on taste and pressure receptors was happening. | 
was part of the team that helped to show how the organization of 
proteins in photoreceptor cells is essential for flies to see light. | 
returned home to work as a junior professor at the University of 
Chile in 1998. 

Even now, few institutions in South America provide start-up 
funds to new faculty members. Most young professors have to join 
senior laboratories or sit in an empty lab, sometimes for more than a 
year, before getting their first grant. By contrast, | had a US$35,000 
repatriation fund from my Pew fellowship. The money was enough 


to buy small, essential equipment to start doing some simple 
experiments soon after | returned: a table-top centrifuge to separate 
cells into basic components, power supplies, electrophoresis 
chambers to run gels for DNA analysis, a mechanical shaker to grow 
bacteria and some reagents. 

Since then, | have trained nearly two dozen students to work with 
flies and have helped four researchers to set up their own labs for fly 
research in Chile. | have also directed three international courses to 
train Latin American students to use the insects (and, more recently, 
worms) as animal models. 

And my relationship with Pew continues. | have started 
collaborations with scientists from other countries whom | met 
at annual Pew alumni meetings. For the past five years, | have 
served on the regional Pew committee that selects six Chilean 
candidates for the fellowship. We look for young researchers who 
have connected with a great lab and proposed adventurous projects 
— particularly to work in areas or with animal models that are not 
available at home. The hope is that they will bring those skills back 
to their native countries. 

Chile has an 80% repatriation rate. That bespeaks both a good 
selection process and the importance of the start-up money for 
returning fellows. Scientific agencies and governments in Latin 
America should try to replicate these measures to help to build a 
stronger and more innovative scientific community. Jimena Sierralta, 
University of Chile 


meetings are attended by Latin American 
fellows, biomedical scholars and senior 
advisers, including Nobel laureates and 
Howard Hughes Medical Institute scholars. 
Participants share ideas and start collabora- 
tions as a result of the meetings. 


SUCCESSFUL SCHOLARS 

Ina survey sent out in 2013 to 202 alumni 
of the Latin American fellows programme 
between 1991 and 2011, an impressive 151 
responded. Alumni who have returned to 
their home countries include department 
heads and university provosts. Nearly half 
reported holding a director position, such 
as department chair or head of an academic 
discipline. On average, each fellow had 
published 15 papers, and those who had 
returned home had trained 13 scientists, 
from technicians and graduate students to 
visiting scholars. 

Last month, the journal Cell high- 
lighted a 2003 Pew fellow, immunologist 
Dario Zamboni, as one of 40 notable sci- 
entists under 40 years old. Zamboni is head 
of the Innate Immunity and Microbial 
Pathogenesis laboratory at the University 
of Sao Paulo in Brazil. His group is work- 
ing out how the body responds to intracel- 
lular parasites, including the one that causes 
Chagas disease — a problem in poor, rural 
areas of South America. Doing science in 
Brazil involves hurdles that would not exist 
in the United States, but he is determined 
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to improve the system for other scientists 
in the country. 

Selection of fellows starts with estab- 
lished researchers in Latin America. 
Argentina, Brazil, Chile and Mexico have 
national committees of former Pew fel- 
lows and senior scientists. Each committee 
selects six applicants by evaluating research 
proposals and interviewing a dozen or so of 
the most promising students. (The chairs 
of these committees act together as a fifth 

multinational com- 


“The fellows mittee for applicants 
programme from the other coun- 
is just a drop tries in the region.) 

in the ocean Thirty applica- 
relative to tions are chosen in 
the need of total to be evaluated 
the entire by a central commit- 
continent.” tee of outstanding US 


scientists with strong 
ties to Latin America. Several are emigrants 
from the dark periods in their countries of 
origin. These committees work hard to 
select the most promising scholars and send 
them to the best labs. 

The Brazilian state of Sao Paulo plans 
to augment the benefits that are open to 
returning Pew fellows: they can apply for 
a generous four-year stipend to get their 
new labs off the ground. The hope is that 
other nations will use their own resources 
to extend this initiative to foster their best 
scientists. 
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The absolute number of Latin American 
fellows is small — fewer than 250 ina region 
with more than 400 million people. But my 
impression is that they have an outsized 
influence, shaping expectations of what it 
means to bea scientist in Latin America, and 
the fellows high expectations of themselves. 

That said, the fellows programme is just a 
drop in the ocean relative to the need of the 
entire continent. This is perhaps especially 
true now that larger programmes exist in 
several Latin countries to support the train- 
ing of scientists abroad and to encourage 
trained scientists to return home, such as 
the Brazil Scientific Mobility Program (see 
page 207). 

Nonetheless, like a seed planted in a 
fertile soil, the Pew programme has flour- 
ished over the past 20 years. The plant will 
no doubt continue to grow and to sup- 
port its ecosystem. The ultimate success 
would be that this type of programme is 
no longer needed because each country 
would have developed strong, independent 
scientific establishments. But for now, we 
need to bolster the support for scientists in 
emerging countries, in Latin America and 
elsewhere. m 


Torsten Wiesel is president emeritus of 
Rockefeller University in New York City, 
USA. He won the 1981 Nobel Prize in 
Physiology or Medicine. 

e-mail: wiesel@rockefeller.edu 
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Unregulated sales of medicines in developing countries contribute to the rise in antimicrobial resistance. 


An intergovernmental panel 
on antimicrobial resistance 


Drug-resistant microbes are spreading. A coordinated, global effort is needed to keep 
drugs working and develop alternatives, say Mark Woolhouse and Jeremy Farrar. 


tion (WHO) produced a global map’ of 

antimicrobial resistance, warning that a 
‘post-antibiotic’ world could soon become a 
reality. In some ways, it already has. 

Drugs that were once lifesavers are now 
worthless. Chloramphenicol, once a physi- 
cian’s first choice against typhoid, is no longer 
effective in many parts of the world. Strains of 
extensively drug-resistant tuberculosis (TB), 
methicillin-resistant Staphylococcus aureus 
(MRSA), multidrug-resistant Escherichia coli 
and Klebsiella pneumoniae are serious threats 
to public health. Plasmodium falciparum (the 
parasite that causes the most dangerous form 


L* month, the World Health Organiza- 


of malaria) is developing resistance to all 
known classes of antimalarial drug, threat- 
ening the remarkable progress that has been 
made against the disease. HIV is increasingly 
resistant to first-line antiviral drugs. Every 
class of antibiotic is increasingly compro- 
mised by resistance, as are many antivirals, 
antiparasitic and antifungal drugs. 

It could get worse: routine medical care, 
surgery, cancer treatment, organ trans- 
plants and industrialized agriculture would 
be impossible in their present form without 
antimicrobials. And the treatment of many 
infectious human and livestock diseases now 
relies on just one or two drugs. 
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Resistance has spread around the world. 
MRSA has spread between continents’, as 
have resistant strains of TB, malaria, HIV 
and pneumococci. Genes conferring resist- 
ance to B-lactams — antibiotics used against 
a broad range of infections, including E. coli 
and K. pneumoniae — have spread to bac- 
terial populations worldwide, probably 
originating in the Indian subcontinent’. 
Numerous drug-resistant malaria strains 
have spread from southeast Asia to Africa. 

Antimicrobial resistance is a global prob- 
lem that requires global solutions. So far, 
the international response has been feeble. 
The WHO accepted only last month > 
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> that antimicrobial resistance might fall 
within the remit of the International Health 
Regulations’, which were implemented in 
2007 to deal with events such as influenza 
pandemics. The regulations’ extension to 
antimicrobial resistance would oblige the 
196 signatory countries to carry out effec- 
tive surveillance and timely reporting for 
outbreaks of resistance. 

Better surveillance is essential. But it will 
not provide solutions; many calls to action 
on antimicrobial resistance have been made 
over the past 20 years, but there has been 
too little progress. The WHO missed the 
opportunity to provide leadership on what is 
urgently needed to really make a difference. 

What is required is committed and coordi- 
nated action on the root causes of resistance: 
the misuse of antimicrobials, the paucity 
of development of new drugs and the lack 
of alternatives. Guidelines must be imple- 
mented to improve the use of existing drugs; 
the scientific and business worlds need incen- 
tives and a better regulatory environment to 
develop new drugs and approaches, and those 
working in both the animal and human sec- 
tors need education and incentives to help 
them to change their ways. 

We call for the creation of an organization 
similar to the Intergovernmental Panel on 
Climate Change (IPCC) to marshal evidence 
and catalyse policy across governments and 
stakeholders. 


USE AND MISUSE 
Although all kinds of microbes evolve 
resistance, resistant bacteria are currently the 
greatest cause for concern. It is no coincidence 
that the nations with the strictest policies on 
antibiotic prescription (Scandinavian coun- 
tries and the Netherlands) have the lowest 
rates of resistance. But in most of the devel- 
oped world, clinical use of antibiotics has 
not declined, despite frequent calls to curtail 
overuse. In developing countries with rising 
incomes, consumption is surging; sales of 
even relatively expensive antibiotics increased 
fivefold in India and tripled in Egypt in 
2005-10 (see ‘A market for futility’). This 
growth is fuelled by unregulated, over-the- 
counter sales of antimicrobials of all kinds. 
In the United States, antibiotic usage in 
humans is matched by that in farm animals, 
mainly as growth promoters. The Euro- 
pean Union banned the use of antibiotics as 
growth promoters in animals in 2006, but 
the situation is little better. As industrialized 
agriculture expands, notably in Asia, animal 
antibiotic usage will continue to grow. 
Mitigating resistance will require coordina- 
tion across sectors. Physicians, pharmacists, 
veterinarians, patients and farmers all con- 
tribute to the overuse of antimicrobials. All 
have a part to play in using them more intel- 
ligently. However, changing practices in the 
hospital, clinic or farm is not easy. The onus 
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A MARKET FOR FUTILITY 


Antibiotic use is surging worldwide, especially 
in the developing world, where unregulated 
sales are soaring. 
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is on countries that are major producers and 
consumers of antimicrobials — especially 
the United States and European nations, and 
increasingly India and China — to introduce 
policies that promote best practice. 

Currently, national efforts are patchy and 
disconnected. The United Kingdom last year 
published a five-year strategic plan to com- 
bat resistance (see go.nature.com/ideq6t), 
although with no new money attached. 
Vietnam aims to combat resistance through 
its VINARES proj ect’, but most countries 
have no such programmes. The United 
States is still debating how to reduce the use 
of growth promoters in animals. Regional 
initiatives such as the European Antimicro- 
bial Resistance Surveillance Network are yet 
to be replicated elsewhere. Controls that do 
exist are often weakly implemented or are no 
more than voluntary guidelines. 


RESISTANCE IS NATURAL 
Most of the antibiotics in use today, from 
penicillin to carbapenems, originated in 
soil. Long before they were used as medi- 
cines, soil microbes were producing antibi- 
otics, and bacteria were evolving resistance 
to these natural compounds. This has been 
happening for perhaps billions of years° on 
a massive scale: there are at least 50 tonnes 
of bacteria for every person on the planet’. 

Humans became involved with the manu- 
facture of antibiotics on an industrial scale 
only in the 1940s. Today, 20 tonnes of anti- 
biotics are produced every hour, contribut- 
ing to a global industry that is worth more 
than US$30 billion a year. We are now ina 
race against evolution; new antimicrobials 
are deployed and, often within a few years, 
resistance develops. Factory-produced anti- 
biotics are presenting bacteria with a type 
of chemical attack that they have overcome 
many times before. 

Between 1983 and 1992, 30 new antibiot- 
ics were approved by the US Food and Drug 
Administration. From 2003 to 2012, the 
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number was just seven. Why? Because there 
are too few incentives and too many regula- 
tory barriers for the commercial sector to 
invest what is needed for the development 
of new antimicrobials®. Drug development is 
risky, and antibiotics do not generate as much 
revenue as drugs for chronic conditions do. 
Drug companies find that research in other 
diseases is a better return on investment. 


AGLOBAL APPROACH 

In many ways, antimicrobial resistance is 
similar to climate change. Both are processes 
operating ona global scale for which humans 
are largely responsible. In antimicrobial resist- 
ance, as in climate change, the practices of one 
country affect many others. 

One key difference is that, for climate 
change, technologies exist to produce energy 
without burning fossil fuels, and investments 
and incentives will make them practical and 
affordable. Alternatives to antimicrobials 
— such as probiotics, prebiotics or phage 
therapy — are still, at best, experimental’. 
More research on alternatives is urgently 
needed, coupled with efforts by industry, 
academia and governments to market them 
in a scalable way. 

There have also already been internation- 
ally agreed, evidence-based targets for cutting 
carbon dioxide emissions. There are no global 
targets for reducing antimicrobial use and no 
real understanding of how to set them. We do 
not even know what, ifany, level of antimicro- 
bial usage will be sustainable in the long term. 

The threat of anthropogenic climate 
change led to the creation in 1988 of the 
IPCC. Despite its limitations, the panel is 
arguably the most successful attempt in 
history to empower scientific consensus to 
inform global policy and practice. 

Another useful precedent is the Montreal 
Protocol on Substances that Deplete the 
Ozone Layer, the first universally ratified 
treaty in the history of the United Nations. 
Faced with clear data that the ozone layer, 
which protects Earth from ultraviolet radia- 
tion, was under threat, governments agreed 
on a timetable to phase out ozone-depleting 
chemicals. The protocol, which came into 
force in 1989, is considered the most success- 
ful global environmental treaty, and has led to 
the shrinking of the ozone hole. 

We believe that similar global approaches 
should be attempted to address problems in 
public health. There is a need for a power- 
ful panel to marshal the data to inform and 
encourage implementation of policies that 
will forestall the loss of effective drugs to 
resistance, and to promote and facilitate the 
development of alternatives — a panel akin 
to the IPCC, and the analogous Intergov- 
ernmental Science-Policy Platform on Bio- 
diversity and Ecosystem Services founded in 
2012. An intergovernmental panel on anti- 
microbial resistance (IPAMR) must have 
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the same firm foundation on the best 
available science and potentially an even 
stronger mandate for action. 

From the outset, the IPAMR needs to 
avoid simply restating the problem. It must 
move rapidly to an agenda that includes 
identifying key knowledge gaps and how 
to fill them; assessing viable short- and 
long-term solutions; evaluating barriers 
to implementation; and setting out road 
maps for sustainable control of disease- 
causing microbes. It could, for example, 
support studies to investigate dosing 
regimes that stall resistance, coordinate 
incentives for developing new types of 
antimicrobial and set targets for prescrip- 
tions and animal use. 

To have any chance of achieving these 
objectives, the IPAMR must be trusted 
and free of vested interests. It will need to 
involve a broad range of experts, encom- 
passing clinical and veterinary medicine, 
epidemiology, microbiology, pharmacol- 
ogy, health economics, international law 
and social science. It will need technical, 
financial, industrial and political support 
from governments and agencies includ- 
ing the WHO, the World Organisation 
for Animal Health, the World Trade 
Organization and the United Nations, as 
well as from representatives of producers 
and consumers of antimicrobial drugs. 
Above all, it will need strong, independ- 
ent leadership. 

Creating an effective IPAMR will be 
a huge undertaking, but the successful 
global campaign to eradicate smallpox, 
led by the WHO, demonstrates that a 
coordinated, international response to 
a public-health threat can work. The 
attempt must be made — otherwise, the 
massive health gains made possible by 
antimicrobial drugs will be lost. = 
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Bring microbial 
Sequencing to 
hospitals 


Analysing bacterial and viral DNA can help doctors to 
pick effective drugs quickly, says Sharon Peacock. 


patient goes to her doctor with dictated by the slow growth rate of the 
Ax: cough and night sweats. pathogen (Mycobacterium tuberculosis). 
Rapid tests confirm the diagno- In the meantime, the doctor must make an 
sis of tuberculosis and hint at multidrug educated guess about which medicines to 
resistance. But to suggest the optimum drug prescribe, increasing the risk of ineffective 
combination, as many as eight weeks of lab- treatment and spread of infection. 
oratory testing are required — a timescale Yet it would take less than a week to 
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A digital model of a nucleosome, drawn with the use of X-ray crystallography data. 


STRUCTURAL BIOLOGY 


More than a 
crystallographer 


Researchers trained in X-ray crystallography are stillin 
demand, but must diversify their skill sets to be competitive. 


BY LAURA CASSIDAY 


arolin Luger was bitten by the crystal- 

k lography bug during a biophysics lec- 
ture in 1986. “One person gave a talk on 

X-ray crystallography,’ she recalls. “The lecture 
was not that good, but the diffraction patterns 
were so beautiful that I thought, ‘I really want to 
learn howto do this.” She learned. As a postdoc, 
she was first author of a paper that reported the 


crystal structure of a DNA-protein complex 
called the nucleosome (see K. Luger et al. Nature 
389, 251-260; 1997). 

Now a Howard Hughes Medical Institute 
investigator at Colorado State University in Fort 
Collins, Luger still uses X-ray crystallography 
to study chromatin, the DNA-protein complex 
that packages genomes tightly inside cells. But 
like most in her field in recent years, she has 
expanded her toolkit to include other methods. 


Twenty years ago, many academic labs 
existed just for X-ray crystallography. Col- 
laborators would send in samples of their 
molecules of interest, and labs would crystal- 
lize them and solve their structures. Nowadays, 
labs are much more focused on specific sci- 
entific questions, and X-ray crystallography is 
just one of a suite of tools that they use. Tech- 
nology has improved so much that the proce- 
dure is usually no longer a full-time scientific 
pursuit. As ‘pure’ crystallography jobs dwindle, 
people who are trained in the technique must 
broaden their expertise to encompass skills 
such as protein expression and purification, 
biochemical assays and cell biology. 

In fact, many crystallographers now refer to 
themselves as structural biologists, reflecting 
the variety of techniques that they use to probe 
molecular structure. They may have PhDs in 
biophysics, biochemistry, bioinformatics or 
computational biology, and find work in aca- 
demia or industry. But they are united by a 
desire to ‘see’ the invisible molecules that make 
up cells. Those structures, often breathtaking 
in their beauty and intricacy, provide impor- 
tant clues about functions or sites that might 
serve as drug targets. 


CRYSTALLIZING THE HISTORY 

X-ray crystallography has been around for 
about a century, since scientists realized that 
atoms in a crystal could diffract X-rays, pro- 
ducing a pattern of spots on a detector. The 
angles and intensities of the diffracted beams 
reveal the structure of molecules. 

Until recent decades, only specialists with 
years of training and expensive equipment 
could perform X-ray crystallography. But in 
the 1990s, the technique became much more 
accessible. As synchrotrons — large, ring- 
shaped particle accelerators that produce 
powerful X-rays — spread across the globe, 
researchers could take or send their crystals 
to the synchrotron facilities, where resident 
experts guided them in collecting data and 
interpreting results. The automation of crys- 
tallization, improvements in methods for solv- 
ing structures and a boost in computing power 
greatly sped up the process, giving researchers 
time for other scientific pursuits. 

Increased competition for research grants 
also forced crystallography labs to become > 


CRYSTALLOGRAPHY AT 100 


| A Nature special issue 
74 vature.com/crystallography 
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> more well rounded. Instead of just solving 
one structure after another, researchers must 
now link the structure of a molecule to its func- 
tion through biochemistry and cell-biology 
experiments. “It’s no longer enough to conjec- 
ture about the function of a particular protein. 
You have to test it,” says Wayne Hendrickson, 
who specializes in biochemistry and molecu- 
lar biophysics at Columbia University in 
New York. 

The story of major crystallography projects 
such as the Protein Structure Initiative (PSI), 
supported by the US National Institute of Gen- 
eral Medical Sciences (NIGMS), encapsulates 
the evolution of the field. The PSI has solved 
more than 5,300 distinct protein structures 
and spurred innovations in crystallographic 
methods. Last year, however, NIGMS director 
Jon Lorsch, acting on the counsel of an advi- 
sory panel, decided that the project had run its 
course, and it will terminate on 30 June 2015 
(see Nature 503, 173-174; 2013). 

Critics argued that many of the structures 
that the PSI has solved have little relevance to 
important biological and medical problems, 
and that PSI scientists did not adequately 
poll the biological community to select inter- 
esting targets. In addition, such ‘big science’ 
programmes consume precious funds that, in 
the minds of some, would be better spent on 
individual researcher grants. 

Despite the PSI’s closure, Hendrickson, 
whose lab specializes in membrane proteins 
and was part of the initiative, says that it is too 
early to gauge the impact on crystallography 
job prospects. “It will depend on whether PSI 
centres like ours are able to gain alternative 
means of support to keep things going,” he 
says. His centre, the New York Consortium 
on Membrane Protein Structure, is applying 
to other research organizations and founda- 
tions for grants. 


TRIAL AND ERROR 

Crystallography work increasingly requires a 
good scientific question rather than just solv- 
ing structures — something Sheena D'Arcy 
knows well. As a graduate student, she worked 
in acrystallography-only lab. “For my postdoc, 
I wanted a lab that was a bit more driven by 
scientific questions,’ she says. She is now work- 
ing with Luger, using crystallography — and 
other methods — to study how DNA is pack- 
aged into chromatin. 

Early in her postdoc, D'Arcy recognized the 
value of approaching a problem with multiple 
techniques. She wanted to obtain a crystal 
structure of nucleosome assembly protein 1 
(Nap1), which helps to package DNA in the 
cell. But she could not get the protein complex 
to crystallize. And so, while still working on 
crystallization on the side, she tried an alter- 
native technique — hydrogen-deuterium 
exchange mass spectrometry. That provided 
important insights into the structure, and 
D*rcy published a paper on it (S. D’Arcy et al. 


Mol. Cell 51, 662-677; 2013). She says that 
anyone who is interested in structural biol- 
ogy should consider learning this technique, 
as well as nuclear magnetic resonance (NMR) 
spectroscopy. 


FRESH APPROACHES 

Now that synchrotrons are widespread, crys- 
tallography labs no longer need their own 
expensive X-ray facilities. Luger’s lab does 
retain an X-ray generator for quickly screen- 
ing crystals and training students; the device is 
powerful enough to collect publication- quality 
data from well-ordered crystals that diffract 
well, but non-ideal crystals or those that are 
quickly degraded by X-rays are sent to a syn- 
chrotron, says D’Arcy. The team has access to 
a beamline — a path of X-rays coming off the 
accelerator — at the Advanced Light Source 
synchrotron at Lawrence Berkeley National 
Laboratory in Berkeley, California. 

The crystallography purist who prefers not 
to dabble in other techniques might consider a 
career as a beamline scientist, loading crystals 
for researchers and overseeing them as they 
collect data. As well as permanent positions, 
many synchrotrons offer training programmes 
in crystallography. They also offer summer 
programmes and internships for students, 
postdocs and other researchers who want to 
learn the technique but lack their own X-ray 
facilities. 

The European Synchrotron Radiation Facil- 
ity (ESRF) in Grenoble, France, offers a six-week 
Summer Bachelor Programme for undergrad- 
uates, which includes lectures, tutorials, lab 
work and site visits. The Cheiron School at the 
SPring-8 synchrotron 
in Harima, Japan, has 
ten-day training ses- 
sions for graduate stu- 
dents, postdocs and 
young scientists who 
wish to pursue careers 
in fields that involve 
synchrotron radiation. 
And the Advanced 
Photon Source in 
Argonne, Illinois, pre- 
sents an annual two- 


“Taking the time week National School 
to sit down and on Neutron and X-ray 
teach yourself Scattering, in which 
the theory graduate students 
and computer attend lectures and 
programs is tutorials and conduct 
going to pay in short experiments. 

the long run.” Alexei Bosak began 
Sheena D’Arcy working at the ESRF 


as a postdoc and is 
now a beamline scientist. His duties are split 
between his own research interests in materi- 
als science (he has beam time reserved for his 
own experiments) and the research of ESRF 
users. “The people come, and we have to make 
them happy running the experiments,’ Bosak 
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Karolin Luger, a researcher in X-ray crystallography. 


« . ‘ 
says. “Sometimes we are less involved, and 
sometimes we are more involved. But quite 
frequently a collaboration results.” 


NEXT GENERATION 

Structural biologists are developing methods to 
expand the capabilities of conventional X-ray 
crystallography, with potential implications for 
future practitioners. In November 2013, the US 
National Science Foundation (NSF) awarded a 
US$25-million Science and Technology Center 
Grant to the University at Buffalo in New York 
and seven partner institutions to fund the 
BioXFEL research centre. The centre will fur- 
ther the use of recently developed tools called 
X-ray free-electron lasers (XFELs) that produce 
much shorter and more intense pulses of X-rays 
than synchrotrons (see page 604). 

According to Eaton Lattman, a struc- 
tural biologist at Buffalo and director of the 
BioXFEL, XFELs can analyse crystals that are 
1,000 times smaller than those required for 
conventional X-ray crystallography. “This 
opens up a whole new universe of protein mol- 
ecules for crystallography that we couldn't do 
before because we couldn't grow big enough 
crystals,” he says. The intense X-ray pulses 
can also capture frozen images of molecular 
motion, opening the door for dynamic studies 
and molecular movies. 

The BioXFEL centre will make use of an 
existing facility at the SLAC National Accel- 
erator Laboratory in Menlo Park, California, 
among other facilities. A smaller XFEL facility 
began operating in Harima, near the SPring-8 
synchrotron, in 2011. Anda larger one is sched- 
uled to open in Hamburg, Germany, in 2015. 

Lattman anticipates that the NSF grant will 
result in a “modest number” of new jobs at 
member institutions. “Right now, we're really 
limited by the amount of beam time that is 
available,’ he says. “If we start to see more coun- 
tries around the world building XFEL facilities, 
then I think we'll see growth in the field compa- 
rable to what we saw for traditional crystallog- 
raphy in the 1990s.’ For now, the field of XFELs 


JOHN EISELE, COLORADO STATE UNIVERSITY 


needs technical improvements, such as bet- 
ter data-processing software and specimen 
delivery systems. 


EXPERTS NEEDED 
Ironically, the very diversification in skills 
now required to obtain an academic job 
has arguably turned many structural biolo- 
gists into jacks of all trades, masters of none. 
Today’s researchers are accustomed to send- 
ing crystals to synchrotrons for analysis, and 
computer programs perform the analytical 
work. “To solve a straightforward struc- 
ture, you really don't have to understand 
the theory and the maths, and that’s a bit of 
a pity,” says Luger. “I’m a little worried that 
were running out of people who know how 
to handle problems or complex situations.’ 
Bosak notes that positions related to 
crystallography are frequently available 
at ESRE, and that they are hard to fill. “It’s 
very difficult to find a good crystallogra- 
pher these days,’ he says. Beamline scien- 
tists must have a thorough understanding 
of crystallography theory and instrumen- 
tation, skills that many modern training 
programmes do not emphasize. This means 
that a crystallographer with the right skill 
set can find that he or she is in demand. 
There is also a growing list of contract 
companies that specialize in crystallogra- 
phy. Firms such as Proteros Biostructures 
in Planegg, Germany; Shanghai Medicilon 
in China; and Emerald Bio in Bedford, 
Massachusetts, provide full-service crystal- 
lography to clients, many of which are phar- 
maceutical companies. The firms employ 
scientists at bachelor’s, master’s and PhD 
levels to carry out all steps of crystallogra- 
phy, from protein design to structural analy- 
sis. But pharmaceutical companies such as 
Merck, based in Whitehouse Station, New 
Jersey, and Novartis, based in Basel, Swit- 
zerland, still have their own crystallography 
programmes centred on structure-based 
rational drug design, which also employ 
scientists at all levels. These companies are 
potentially a better fit for those who wish 
to focus on a specific protein or biological 
process rather than a plethora of them. 
D/Arcy advises students with an interest 
in X-ray crystallography to take the time to 
learn its theoretical underpinnings and all 
the techniques involved. “Don't let people 
do things for you,’ she says. “There are a 
lot of senior people who know how to do 
things, and there’s always a time crunch to 
get data — you get crystals, and you just 
want to see the structure. Taking the time to 
sit down and teach yourself the theory and 
computer programs is going to pay in the 
long run — because you really learn when 
things go wrong.’ m 


Laura Cassiday is a freelance writer based 
in Hudson, Colorado. 


TURNING POINT 


Nicholas Wright 


As a student, Nicholas Wright pursued 
interests in biology and public policy, 

securing four degrees and a fellowship in the 
department of government at the London 
School of Economics (LSE). He now uses his 
neuroscience training and insights into human 
decision-making to inform nuclear-security 
policy as a fellow at the Carnegie Endowment 
for International Peace in Washington DC. 


Did you always have dual interests? 

Yes. I went straight to medical school at Uni- 
versity College London (UCL), but I also did 
a year at Imperial College London studying 
health policy and management, which proved 
a turning point. While there, I did research in 
Chile on how best to incorporate scientific find- 
ings into clinical medicine. I learned that, to be 
effective, public policy must always take cultural 
and organizational factors into account; and I 
learned how best to ask questions so that they 
are relevant to public policy. 


How did you combine your interests? 

At the end of my medical degree, I went to a series 
of lectures by economist Richard Layard from 
the LSE, who talked about what neuroscience 
might be able to tell us about economic and social 
decision-making. I read up on neuroscience 
and decided to do a master’s degree. My 
research into functional magnetic resonance 
imaging (fMRI) dispelled the hypothesis that 
only one area of the brain specializes in reading. 
The technique surpassed my expectations and 
proved itself to be a new source of information 
that could be relevant to public policy. 


How did you delve into decision-making? 

It wasnt by chance. After my postgraduate med- 
ical exams, I dida PhD project to study how risk 
perception influences decision-making, hoping 
to apply the concepts to issues of public policy. 
I worked with the Wellcome Trust Centre for 
Neuroimaging at UCL and stayed on asa fellow 
doing fMRI after I finished my PhD. 


How did you position yourself for a policy job? 
During a year-long fellowship at the LSE, I 
built up my contacts, planned events with 
policy-makers and created a narrative about 
my experience. Several policy-oriented job 
opportunities in Washington DC came up, 
but a position at the Carnegie Endowment for 
International Peace was most exciting. 


What appealed to you about that post? 
There was a lot of great work done in the 1970s 
on applying decision-making and cognitive 


psychology to nuclear strategy, but much less 
had been done recently. The ideas coming out 
of neuroeconomics hadn't yet been applied to 
international relations, so there was enormous 
potential for doing interesting work that could 
have a positive impact on the world. 


Has your work had real-world impact? 

In January, a colleague and I published an arti- 
cle called “The neuroscience guide to negotia- 
tions with Iran’ in The Atlantic. We combined 
insights from neuroscience, behaviour and 
history to better understand Iranian motives 
in the ongoing nuclear talks. For example, 
conciliatory gestures are more effective when 
they're unexpected. Neuroimaging experi- 
ments detail how the brain computes the dif- 
ference between what is expected and what 
actually happens, and the more surprising 
the reward or punishment, the more impact 
it has on decision-making. Last year, Iranian 
President Hassan Rouhani unexpectedly used 
social media to engage on political issues, rais- 
ing hopes for a diplomatic breakthrough. We 
argued that neuroscience provides a new, 
important source of evidence relevant to 
nuclear talks with Iran. Our article was read 
by US and UK defence policy-makers, and I 
have been asked to continue providing briefs 
to the US Department of Defense. 


Do policy-makers value a science background? 
In the world of public policy, there are so many 
competing priorities that there is a limit to how 
much science can be used. Winston Churchill 
once said that scientists “should be on tap, but 
not on top”. Although science is not the only 
consideration, I am on tap to provide it. m 
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NIH plans to enhance 
reproducibility 


Francis S. Collins and Lawrence A. Tabak discuss 
initiatives that the US National Institutes of Health 
is exploring to restore the self-correcting nature of 

preclinical research. 


growing chorus of concern, from 
Az and laypeople, contends 

that the complex system for ensuring 
the reproducibility of biomedical research 
is failing and is in need of restructuring”. 
As leaders of the US National Institutes of 
Health (NIH), we share this concern and 
here explore some of the significant inter- 
ventions that we are planning. 

Science has long been regarded as ‘self- 
correcting’ given that it is founded on the 
replication of earlier work. Over the long 
term, that principle remains true. In the 


shorter term, however, the checks and 
balances that once ensured scientific fidelity 
have been hobbled. This has compromised 
the ability of today’s researchers to reproduce 
others’ findings. 

Let’s be clear: with rare exceptions, we 
have no evidence to suggest that irreproduc- 
ibility is caused by scientific misconduct. In 
2011, the Office of Research Integrity of the 
US Department of Health and Human Ser- 
vices pursued only 12 such cases’. Even if 
this represents only a fraction of the actual 
problem, fraudulent papers are vastly 
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outnumbered by the hundreds of thousands 
published each year in good faith. 

Instead, a complex array of other factors 
seems to have contributed to the lack of 
reproducibility. Factors include poor train- 
ing of researchers in experimental design; 
increased emphasis on making provocative 
statements rather than presenting technical 
details; and publications that do not report 
basic elements of experimental design’. 
Crucial experimental design elements that 
are all too frequently ignored include blind- 
ing, randomization, replication, sample-size 
calculation and the effect of sex differences. 
And some scientists reputedly use a ‘secret 
sauce’ to make their experiments work — 
and withhold details from publication or 
describe them only vaguely to retain acom- 
petitive edge®. What hope is there that other 
scientists will be able to build on such work 
to further biomedical progress? 

Exacerbating this situation are the policies 
and attitudes of funding agencies, academic 
centres and scientific publishers. Fund- 
ing agencies often uncritically encourage 
the overvaluation of research published in 
high-profile journals. Some academic cen- 
tres also provide incentives for publications 
in such journals, including promotion and 
tenure, and in extreme circumstances, cash 
rewards’. 

Then there is the problem of what is 
not published. There are few venues for 
researchers to publish negative data or 
papers that point out scientific flaws in pre- 
viously published work. Further compound- 
ing the problem is the difficulty of accessing 
unpublished data — and the failure of fund- 
ing agencies to establish or enforce policies 
that insist on data access. 


PRECLINICAL PROBLEMS 

Reproducibility is potentially a problem in all 
scientific disciplines. However, human clini- 
cal trials seem to be less at risk because they 
are already governed by various regulations 
that stipulate rigorous design and independ- 
ent oversight — including randomization, 
blinding, power estimates, pre-registration 
of outcome measures in standardized, pub- 
lic databases such as ClinicalTrials.gov and 
oversight by institutional review boards and 
data safety monitoring boards. Furthermore, 
the clinical trials community has taken 
important steps towards adopting standard 
reporting elements’. 

Preclinical research, especially work that 
uses animal models’, seems to be the area 
that is currently most susceptible to repro- 
ducibility issues. Many of these failures have 
simple and practical explanations: different 
animal strains, different lab environments or 
subtle changes in protocol. Some irreproduc- 
ible reports are probably the result of coinci- 
dental findings that happen to reach statistical 
significance, coupled with publication bias. 


CHRIS RYAN/NATURE 


Another pitfall is overinterpretation of 
creative ‘hypothesis-generating’ experiments, 
which are designed to uncover new avenues 
of inquiry rather than to provide definitive 
proof for any single question. Still, there 
remains a troubling frequency of published 
reports that claim a significant result, but fail 
to be reproducible. 


PROPOSED NIH ACTIONS 

As a funding agency, the NIH is deeply 
concerned about this problem. Because 
poor training is probably responsible for 
at least some of the challenges, the NIH is 
developing a training module on enhanc- 
ing reproducibility and transparency of 
research findings, with an emphasis on 
good experimental design. This will be 
incorporated into the mandatory training 
on responsible conduct of research for NIH 
intramural postdoctoral fellows later this 
year. Informed by this pilot, final materials 
will be posted on the NIH website by the 
end of this year for broad dissemination, 
adoption or adaptation, on the basis of local 
institutional needs. 

Several of the NIH’s institutes and cen- 
tres are also testing the use of a checklist 
to ensure a more systematic evaluation of 
grant applications. Reviewers are reminded 
to check, for example, that appropriate 
experimental design features have been 
addressed, such as an analytical plan, plans 
for randomization, blinding and so on. A 
pilot was launched last year that we plan to 
complete by the end of this year to assess 
the value of assigning at least one reviewer 
on each panel the specific task of evaluat- 
ing the ‘scientific premise’ of the application: 
the key publications on which the applica- 
tion is based (which may or may not come 
from the applicant's own research efforts). 
This question will be particularly important 
when a potentially costly human clinical 
trial is proposed, based on animal-model 
results. If the antecedent work is question- 
able and the trial is particularly important, 
key preclinical studies may first need to be 
validated independently. 

Informed by feedback from these pilots, 
the NIH leadership will decide by the fourth 
quarter of this year which approaches to 
adopt agency-wide, which should remain 
specific to institutes and centres, and which 
to abandon. 

The NIH is also exploring ways to provide 
greater transparency of the data that are the 
basis of published manuscripts. As part of our 
Big Data initiative, the NIH has requested 
applications to develop a Data Discovery 
Index (DDI) to allow investigators to locate 
and access unpublished, primary data (see 
go.nature.com/rjjfoj). Should an investigator 
use these data in new work, the owner of the 
data set could be cited, thereby creating a new 
metric of scientific contribution unrelated 


to journal publication, such as downloads 
of the primary data set. If sufficiently meri- 
torious applications to develop the DDI are 
received, a funding award of up to three years 
in duration will be made by September 2014. 
Finally, in mid-December, the NIH launched 
an online forum called PubMed Commons 
(see go.nature.com/8m4pfp) for open dis- 
course about published articles. Authors can 
join and rate or contribute comments, and 
the system is being evaluated and refined in 
the coming months. More than 2,000 authors 
have joined to date, contributing more than 
700 comments. 


COMMUNITY RESPONSIBILITY 

Clearly, reproducibility is not a problem 
that the NIH can tackle alone. Conse- 
quently, we are reaching out broadly to the 
research community, scientific publishers, 
universities, industry, professional organi- 
zations, patient-advocacy groups and other 
stakeholders to take the steps necessary to 
reset the self-corrective process of scientific 
inquiry. Journals should be encouraged to 
devote more space to research conducted in 
an exemplary manner that reports negative 
findings, and should make room for papers 
that correct earlier work. 

We are pleased to see that some of the 
leading journals have begun to change 
their review practices. For example, Nature 
Publishing Group, the publishers of this 
journal, announced’ in May 2013 the fol- 
lowing: restrictions on the length of meth- 
ods sections have been abolished to ensure 
the reporting of key methodological details; 
authors use a checklist to facilitate the veri- 
fication by editors and reviewers that criti- 
cal experimental design features have been 
incorporated into the report, and editors 
scrutinize the statistical treatment of the 
studies reported more thoroughly with the 
help of statisticians. Furthermore, authors 
are encouraged to provide more raw data to 
accompany their papers online. 

Similar requirements have been imple- 
mented by the journals of the Ameri- 


can Association for the Advancement of 


Science — Science Translational Medicine in 
2013 and Science earlier this month’ — on 
the basis of, in part, the efforts of the NIH’s 
National Institute of Neurological Disorders 
and Stroke to increase the transparency of 
how work is conducted”. 

Perhaps the most vexed issue is the aca- 
demic incentive system. It currently over- 
emphasizes publishing in high-profile 
journals. No doubt worsened by current 
budgetary woes, this encourages rapid 
submission of research findings to the det- 
riment of careful replication. To address 
this, the NIH is contemplating modify- 
ing the format of its ‘biographical sketch’ 
form, which grant applicants are required 
to complete, to emphasize the significance 


of advances resulting from work in which 
the applicant participated, and to deline- 
ate the part played by the applicant. Other 
organizations such as the Howard Hughes 
Medical Institute have used this format and 
found it more revealing of actual contri- 
butions to science than the traditional list 
of unannotated publications. The NIH is 
also considering providing greater stability 
for investigators at certain, discrete career 
stages, utilizing grant mechanisms that 

allow more flexibility 


“Efforts by and a longer period 
the NIHalone than thecurrentaver- 
will not be age of approximately 
sufficient to four years of support 
effect real per project. 

change in this In addition, the 
unhealthy NIH is examining 


ways to anonymize 
the peer-review pro- 
cess to reduce the effect of unconscious 
bias (see go.nature.com/g5xr3c). Currently, 
the identifiers and accomplishments of 
all research participants are known to the 
reviewers. The committee will report its 
recommendations within 18 months. 

Efforts by the NIH alone will not be suf- 
ficient to effect real change in this unhealthy 
environment. University promotion and 
tenure committees must resist the tempta- 
tion to use arbitrary surrogates, such as the 
number of publications in journals with 
high impact factors, when evaluating an 
investigator's scientific contributions and 
future potential. 

The recent evidence showing the irre- 
producibility of significant numbers of 
biomedical-research publications demands 
immediate and substantive action. The NIH 
is firmly committed to making systematic 
changes that should reduce the frequency 
and severity of this problem — but success 
will come only with the full engagement of 
the entire biomedical-research enterprise. = 


environment.” 


Francis S. Collins is director and 
Lawrence A. Tabak is principal deputy 
director of the US National Institutes of 
Health, Bethesda, Maryland, USA. 
e-mail: lawrence.tabak@nih.gov 
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My life with Parkinson’s 


A neuroscientist reflects on his experience of studying the circuits that 
control neural activity while his own brain began slowly failing him. 


oughly a year ago, I found myself at 
R= elegant dinner party filled with 
celebrities and the very wealthy. Iam 
a young professor at a major research univer- 
sity, and my wife and I were invited to mingle 
and chat with donors to the institution. To 
any outside observer, my career was ascend- 
ant. Having worked intensely and passion- 
ately at science for my entire adult life, I had 
secured my dream job directing an inde- 
pendent neuroscience research laboratory. 
I was talking to a businessman who had 
family members affected by a serious medi- 
cal condition. He turned to me and said: 
“You're a neuroscientist. What do you know 
about Parkinson's disease?” 


My gaze darted to catch the eyes of my 
wife, but she was involved in another con- 
versation. I was on my own, and I paused 
to gather my thoughts before responding. 
Because I had a secret. 

It was a secret that I hadn't yet told any of 
my colleagues: I have Parkinson's. 

Iam still at the beginning of my fascinat- 
ing, frightening and ultimately life-affirming 
journey as a brain scientist with a disabling 
disease of the brain. Already it has given me 


NEW ANGLES ON THE BRAIN 


A Nature special issue 
www.nature.com/neuroscience2013 


a new perspective on my work, it has made 
me appreciate life and it has allowed me to 
see myself as someone who can make a dif- 
ference in ways that I never expected. But it 
took a bit of time to get here. 


THE FIRST SIGNS 

I remember the first time I noticed that 
something was wrong. Four years ago, I was 
filling out a mountain of order forms for new 
lab equipment. After a few pages, my hand 
became a quaking lump of flesh and bone, 
locked uselessly in a tense rigor. A few days 
later, I noticed my walk was changing: rather 
than swinging my arm at my side, I held it 
in front of me rigidly, even grabbing the 
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> bottom edge of my shirt. I also had an 
occasional twitch in the last two fingers of 
my hand. 

I was 36 years old and it was the most ter- 
rifying time of my life, even without any of 
these mysterious symptoms. In the span of 
six months, I found myself in a job that I had 
spent 20 years preparing for, I becamea father 
for the second time, I moved across the coun- 
try to a town where we knew no one, and I 
was working alone in an empty lab wondering 
who left me in charge. I study the way that 
neuromodulatory chemicals such as dopa- 
mine affect neural activity and behaviour. 
And now, my own brain chemistry was rebel- 
ling against me. 

I considered many possibilities. A brain 
tumour? Dystonia? Motor neurone disease? 
Huntington's disease? Multiple sclerosis? 
Was I just stressed out? 

My diagnosis came from a young neu- 
rology fellow at one of the world’s leading 
centres for the study of movement disorders. 
He felt more like a peer than an authority 
figure. He, too, spent a lot of time in the lab 
doing basic research and published papers in 
some of the same journals as me; we could 
have just as easily run into each other at a 
scientific meeting. Asa result, the experience 
of my diagnosis was oddly collegial. 

Right away, I wondered how long I could 
get away without telling my colleagues. I wor- 
ried that I would be less likely to get the grants 
I needed to run my lab if the reviewers were 
not confident about investing in my future. 
I wondered whether students and postdocs 
would be afraid to join my research group. 
And, perhaps most importantly, how long I 
would be able to do experiments — the thing 
that I most love. Stiffness, shaking, fatigue, 
jerky movements, falls, drooling, laboured 
speech and the expressionless Parkinsonian 
mask. These could all be a part of my future. 


MIND MATTERS 

I was diagnosed with Parkinson's more than 
two years ago. From that day, I have hada 
different relationship with the brain — my 
scientific focus for the past 20 years. I now 
know what it is like to have a brain disorder 
and can explore its manifestations first hand. 
Take the very peculiar symptom known as 
‘freezing. Occasionally, when I attempt to lift 
my hand it well... won't. Notice that I didn't 
say can't. There is nothing wrong with my 
arm. It is still strong and capable of moving, 
but I have to put effort, even focus, into get- 
ting it to move — frequently to such a degree 
that I have to pause whatever else my brain is 
doing (including talking or thinking). Some- 
times, when no one else is around, I use my 
other hand to move it. 

As a neuroscientist, it is simultaneously 
fascinating and terrifying to be directly con- 
fronted with the intersection of the neuro- 
physiological and philosophical constructs 


of ‘will. The way my mind and body do battle 
forces me to reconsider the homunculus, a 
typically pejorative (among neuroscientists) 
caricature ofa little man pulling levers inside 
our heads, reading the input and dispatch- 
ing the output. Virtually all that we know 
about how the brain is organized belies this 
image, and yet there is a dualism to my daily 
experience. 
Parkinson’, particularly in young people, 
is primarily a disorder of motor control, not 
of cognition. Still, my 


“Tt does experience, however 
affect the limited, leads me to 
day-to-day speculate about what 
mechanics of it is like to be trapped 
experiments bya brain gone rogue. 
forme.” When one begins 


to lose the ability to 
interact with the world, and when one’s 
faculties for clear perception and cognition 
are stripped away, what remains of the con- 
scious self? 

This brings me to one of the main reasons 
that I have kept my disease secret: the stigma 
of ‘mental illness: Because most people do not 
understand Parkinson’, it may be confused 
with cognitive disorders such as schizophre- 
nia and Alzheimer’s disease. I feel as sharp 
and productive as ever, but I wasn’t sure 
that others would have faith in me at a time 
when my career is so fragile. So nearly every 
moment of my life became a performance, in 
which I tried to hide my symptoms. At work, 
at the grocery store, in my front yard, even in 
front of my kids — I am always keenly aware 
of my movements. And nowhere more so 
than at scientific conferences, such as at meet- 
ings of the Society for Neuroscience (SfN). 
You may not notice where my hands are, but 
I do. Often, I am sitting on them. 

Does Parkinson's affect the way I do 
science? It does affect the day-to-day 
mechanics of experiments for me. The tech- 
niques used in my lab require considerable 
motor skill at times. I have had to modify 
how I do some things, including taking more 
time, compensating with my good hand or 
using a different grip on instruments. Still, 
it is pretty remarkable how capable I remain 
at the bench. The lesson in this for me is that 
‘lab hands’ are more about experience, atten- 
tion to detail and adaptation of methods than 
they are about raw dexterity. And the low- 
to-moderate doses of drugs that I take really 
help, as do sleep and exercise. By all indica- 
tions, I will be able to continue research for 
many years, perhaps even indefinitely. 

There is also the question of how this 
diagnosis affects my scientific direction. I 
am sometimes asked whether I will wholly 
or partly switch to studying Parkinson’s dis- 
ease. I suppose I might if the right project 
came along, but in general I remain focused 
on the questions that I have already set 
out for myself. I am also sometimes asked 
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whether my diagnosis makes me impatient 
with the pace of discovery of cures. Here my 
answer is very clear. The dual perspectives of 
my condition and my position as an active 
researcher actually reinforce my belief in the 
importance of discovery science. Iam keenly 
aware that those cures are possible only in 
the wake of decades of basic research. Above 
all, my diagnosis makes me want to do the 
best and most exciting science I can, because 
the privilege could disappear for any of us in 
the blink ofan eye. 


TO TELL OR NOT TO TELL 

Back at the dinner party, all eyes were on me 
waiting to hear my thoughts on Parkinson’. I 
wanted to tell my colleagues what I was going 
through. I wanted to look at our donor and 
say: “Funny you should ask that. Not only 
am I a neuroscientist, but I also have Par- 
kinson’s disease.’ I wanted to launch into an 
eloquent monologue that put a personal face 
on the science of neurodegenerative disease. 
I wanted to conclude by saying, “And that is 
why basic brain research is so important.” 

But I didn’t. 

Instead, I dispassionately described the 
pathology and characteristic symptoms of 
Parkinson's. It was an intellectually engag- 
ing exchange, but it wasn’t the conversation 
it could have been. This is one of the main 
reasons I decided to stop hiding. 

Earlier this year, I told my department 
chair. Over the next few days, I told the 
administration, my lab and many of my col- 
leagues. It took a lot out of me, but it ended up 
being one of the best decisions I ever made. 
Everyone at work was so supportive — I felt 
silly for having spent four years, since the 
onset of my symptoms, worrying about how 
they would react. In the subsequent months, 
it has become a non-issue for me in how I 
interact at work. Everyone treats me like any 
other colleague, and it is such a relief not to 
worry about who knows anymore. It is still 
uncommon for me to tell someone new, but I 
do not do anything to hide my condition, just 
enough to not call attention to it. For anyone 
reading this who is going through something 
similar, I am here to tell you that life is too 
short to run from who you are. Your col- 
leagues might surprise you, and you can still 
bea great scientist despite a disability. 

So why am I writing this piece anony- 
mously? Because I don't want to be known 
to the scientific community as ‘Parkinson's 
guy’ before Iam known as a scientist. That 
said, I'm not hiding any more, so if you care 
you can dig enough to find out who I am. 
I’m okay with that. m 


The author is a neuroscience professor 
at a major US university. He blogs at 
parklifensci.blogspot.com and tweets at 
@Parklifensci. 

e-mail: parklifensci@gmail.com 


